SUPPLEMENTO 


AL VOLUME XX, SERIE X; DEL 


NUOVO CIMENTO 


A CURA DELLA SOCIETA ITALIANA DI FISICA 


1961 20 Trimestre NPA 


An Introduction to the General Theory of Relativity. 


F. R. TANGHERLINI (*) 


Scuola di Perfezionamento in Fisica Teorica e Nucleare - Napoli 


(ricevuto il 5 Agosto 1960) 


CONTENTS. — I. Special relativity with a non-diagonal line element. 
11. How can we measure the velocity of light? 1°2. Synchronization 
methods. 1°3. The line element. — II. Rotating coordinate systems. — 


III. Prineiple of equivalence. - IV. Tensor analysis and Riemannian geo- 
metry. 41. The requirement of general covariance. 4°2. Covariant nota- 
tion. 4°3. Tensor densities. 4°4. Covariant differentiation; parallel displa- 
cement. 45. The equations of motion of a particle; geodesic paths. 
‘46. Riemann curvature tensor. 47. Symmetry properties of the Riemann 
Tensor and Bianchi identies. 48. Contracted curvature tensor, the Ricci 
tensor. — V. The field equations and Schwarzschild’s solution; Equations 
of motion in the Schwarzschild field. 5°1. The Einstein tensor. 5°2. Ein- 
stein’s field equations. 5°3. The Schwarzschild field. 5°4. Orbital equa- 
tions in a Schwarzschild field. 5°5. The gravitational deflection of light. 
5°6. Some orbital formulae common to Newtonian mechanics and general 
relativity. — VI. Mach’s principle and cosmology. 61. The clock paradox 
in the presence of a permanent gravitational field. 6°2. Mach’s principle. 
6°3. Olber’s paradox and the finiteness of the Universe. 6°4. Various 
cosmological models. 6°5. Summary of general relativistic cosmological 
models. 66. Calculation of red-shift from line element for model Universe. — 
VII. Supplementary remarks. — List of references. 


The purpose of these lectures is to present a brief but comprehensive treat- 
ment of the basic physical principles and mathematical techniques employed 
in the general theory of relativity. 


(*) Present address: Departement of Physics, Duke University, Durham, North 
Carolina. 
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We shall assume familiarity with the concepts and techniques of special 
relativity, although for pedagogical purposes we shall review some of the fun- 
damental ideas of special relativity in a way that will serve as an introduction 
to working with metric tensors different from the diagonal metric tensor used 
in special relativity. 

Frequently the question is raised whether the general theory of relativity 
is really necessary to handle the problems of gravitation, that is, could not 
these problems be solved using the «flat » space-time structure of special rela- 
tivity and interactions analogous to the electromagnetic field? As we shall show 
more fully in the text, one encounters serious difficulties if one attempts to 
do this. This is because the electromagnetic field intensities #, transform as 
a tensor, so that we cannot by a co-ordinate transformation, transform away 
the «intensities » of the electromagnetic field, although to be sure, we can 
transform to zero some of the components of the F,, (in general, at a point, 
how many can we transform to zero?). On the other hand, as is well known, 
we can transform away locally the intensities of the gravitational field by 
« freely falling » in it. Hence the components of the gravitational « intensity » 
do not form a tensor; clearly we need to introduce some new mathematical 
notions to handle this situation, such as are involved in the Christoffel symbols 
and covariant differentiation. Thus, although we shall show by explicit cal- 
culation that a special relativity type of theory involving a scalar field, does 
not give the correct answer for the precession of the perihelion of Mercury, 
we would like to make it clear that this was to be expected from the beginning. 

Needless to say, one can try to make more complicated theories based on 
special relativity, and some of these theories apparently give the correct answer 
to the famous three checks of general relativity. However, these theories inva- 
riably do not make it clear why the gravitational intensities can be transformed 
away locally, whereas, once one understands the formalism of general rela- 
tivity the result is obvious. 

Having transformed away the intensities of the gravitational field locally, 
the laws of special relativity once again are valid. The situation is very ana- 
logous to that occurring in Riemannian geometry. For example, consider a 
triangle on the surface of a sphere (say the earth), we know that for sufficiently 
small triangles Pythagoras’ rule holds, but for large triangles this is not the 
case: the geometry is non-Euclidean. As we shall see general relativity incor- 
porates this idea in such a way that it becomes a cardinal feature of the theory, 
so that, in general, the geometry will be non-Euclidean, and hence the paths 
followed by light rays will not be Buclidean straight lines, but their general- 
ization, the so-called null-geodesics. It is mathematically for this reason that 
light rays are deflected by the presence of a gravitational field, for example, 
light rays passing near the sun. From a physical standpoint we would say 
that a light ray, since it possesses energy H=hy, by the principle of equi- 
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valence also has mass m= H/c? = hy/c? and hence «falls » in the gravitational 
field, or is attracted by it and this gives rise to the deflection. However, a 
straightforward application of Newton’s laws only gives } the correct answer. 
The solution is provided by general relativity which also takes into account 
the non-Euclidean structure of the space and provides the additional deflection. 

Needless to say, once again, one might try to maintain special relativity 
and argue that the light rays get scattered by the gravitational field; for 
example, as they do in quantum electrodynamics by a Coulomb field (Del- 
brück scattering). However, this type of process is extremely complicated 
and is essentially non-linear so that we do not simplify matters by proceeding 
in this way. Moreover, from a logical standpoint it would not be satisfactory 
—because one would still have the line element of special relativity which 
implies the null geodesics of the space are those traversed by light rays, which 
would no longer be the case in this type of process. 

It is hoped the above discussion makes it clear that the underlying logical 
construction of the general theory arises to fit the peculiar needs posed by 
the problems involved in gravitation, and to do so in as systematic and simple 
a way as possible. 

No pretext is made that the theory is in any sense final. The present lack 
of a unified field theory makes it clear that it cannot be, and indeed it was 
towards rectifying this lacuna that Einstein devoted nearly forty years of 
his life. 

The enormous difficulties posed by this problem, together with the rela- 
tively few checks of general relativity and the growth of more fertile fields 
of investigation such as atomic and nuclear physics, elementary particles, ete., 
led to a decline in interest in the general theory for many years. However, 
at the present time the theory is meeting renewed interest: this is partly 
due to the development of new experimental techniques and also the conjecture 
on the part of some theoreticians that the fundamental difficulties that confront 
quantum field theory may find their resolution in suitably combining the two 
disciplines. However, since the gravitational potential energy between two 
electrons 10-15 em apart is Gm?/(10- cem) & 10-* MeV, it is clear that only 
a subtle application of the theory will suffice to give effects at this distance 
comparable with those of other forces, for examples, the electromagnetic po- 
tential energy for the same situation is & 0.5 MeV. 

An important application of the general theory is in the field of cosmology. 
This follows from the fact that the overall space-time structure is intimately 
related to the gravitational field via the metric tensor. Moreover, according 
to Mach’s principle, the distant masses of the universe are responsible for the 
inertial effects experienced locally. So that a general theory of relativity must 
of necessity have a great deal to say about cosmology. We shall try to describe 
some of these problems here as well as a few simple cosmological models. 
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CHAPTER I. 


SPECIAL RELATIVITY WITH A NON-DIAGONAL LINE ELEMENT 


11. - How can we measure the velocity of light? 


As a method of gaining familiarity with metric tensors different from the 
one employed in special relativity, 7,, = diagonal tensor (1, —1, —1, —1), we 
shall present an alternative derivation of the Lorentz transformation, which 
will also serve as a review of the basic ideas of special relativity and an intro- 
duction to the concepts and techniques of the general theory. 

It is convenient to begin by asking ourselves the question, « How can one 
measure the velocity of light? » We may assume that there are objects called 
«clocks» and «rigid rods» available, and that we can measure a distance 
between two points A and B in our space, by simply sighting along the light 
path from A to B and along this path introduce a series of unit rods layed 
end to end, and thereby determine the distance from A to B. 

(Clearly in such a method we are assuming the straight line distance be- 
tween A and B lies along the light path. Another method would be to tie a 
wire or string to a fixed object at A and tie it also at B and tighten it, and 
then lay off our distance along this wire with our unit rods. Clearly this 
assumes we have sufficiently inert bodies at A and B to « anchor» our wire 
and also that in tightening the wire it will assume the least distance between A 
and B which we may define to be the straight line distance between A and B. 
We present these considerations to bring out the important point that con- 
cepts such as «distance» have no physical meaning unless we provide an 
operational definition of how one goes about determining them. For example 
the well-know definition of the meter as the length of a bar of platinum-iridium 
kept at Sevres, is not of much help unless we can transfer the equivalent of 
such a length to other places and also have a well-defined operation of ad- 
dition. Note that we can measure fractions of the meter using a «cut and 
try » process.) 

Assuming we have determined the distance from A to B by some such 
process, as above, and assuming we have two similar clocks one located at A 
and one located at B, how would we determine the speed of light from A to B? 
We may imagine a device which sends out a light signal from A and trips 
the clock at A, so that the time #, that the light signal left A is recorded, and 
also a similar device for receiving the light signal at B, so that the time t, of 
arrival is recorded. However, this information does not help us at all unless 
we know also the relative synchronization of the clock at B with the clock 
at A. How do we determine this synchronization? For, without knowing it, 
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it is quite impossible to measure the time of flight from A to B of the light 
signal or indeed any object. This constitutes one of the many fundamental 
observations made by EINSTEIN in formulating the special theory of rela- 
tivity, and it is basic to any discussion of relativity. 


1°2. — Synchronization methods. 


12.1. Out-and-back method. — Various methods for synchronization and 
determining the speed present themselves. One method would be to send the 
light signal from A to B where it is reflected by a mirror and returned to A, 
tripping the clock at the time ¢, upon leaving and the time i upon returning, 
the round trips speed of light v,, might then be defined as 


AB+ BA 
(1.1) ro vanto 


i 


where AB = distance from A to B, and since we shall assume our geometry 
is such that AB=BA, we have vg,=2AB/(t,—t,). However, from this re- 
sult, can we infer what was the speed of light from A to B, and from B to A? 
Clearly the method says nothing about how long it took light to travel from 
Ato B and it would be an assumption on our part that this time interval is 
given by At,,= AB/vgr. Nevertheless, this is the assumption that is made 
in special relativity, so that a clock at B is said to be in synchronization with 
a clock at A, if a light signal having left the clock at time t, arrives at B at a 
time +, given by 


(1.2) DM AB 


The quantity v,, is customarily denoted by «ec». 

Now, it is fairly clear, such an assumption if it holds in one reference frame 
which we denote by S and co-ordinates (t, x, y, 2), cannot also hold in another 
frame moving with speed v (as determined by clocks synchronized in the above 
manner) relative to it, without some dramatic changes in our customary views 
of space and time. 

Before discussing this, however, let us observe that another method of 
synchronization suggests itself. Thus why not simply imagine two clocks at A 
synchronized together and move one of them from A to B and compare its 
time with the clock at B? (We assume that the time of two clocks when they 
are in the neighborhood of a point can be compared with a delay as small as 
we please.) Then clock B synchronizes with the clock at A if it synchronizes 
with the clock that has been moved from A to B. But how do we know nothing 
happens to the clock when we move it? Let us perform the following «ex- 
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periment ». We set up our two clocks at A and our light signaling source, 
and also a receiver at B together with a clock at B. We synchronize B with A, 
by moving the clock from A to B with various speeds (still unknown) and 
then measure the time for light to travel between A and B using clocks which 
have been synchronized in this manner. Moreover, we may imagine that in 
one case we have walked with the clock from A to B, in another case, we have 
transported the clock from A to B with an airplane, and on still a third occa- 
sion we have «shot» the clock from A to B by means of rockets. Then we 
will find that in the limit as we transported our clock from A to B very slowly, 
the speed of light determined in the above fashion will approach e. Moreover, 
we shall find that this result holds in all inertial frames (*). 

Since the orientation of the points A and B was unspecified this speed ¢ 
so determined will clearly be independent of orientation. It also follows that 
the time for a light signal to go out from A to B and then return will be 2AB/e. 
So that we see synchronizing the clocks in this way gives us the familiar re- 
sults. In Sect. 1°3 we shall try to incorporate these ideas into a mathematical 
scheme, such that if they are true in one inertial frame S they are true in 
another inertial frame travelling with speed v relative to S. 


1°2.2. Instantaneous synchronization. — Finally, we should remark still an- 
other method of synchronization might have been considered above, namely: 
synchronization with instantaneous or faster-than-light signals. However, we 
know experimentally that the speed of light represents an upper limit to the 
speed with which a signal can be sent between two points. It is a fundamental 
feature of relativity that this should be the case, so that the speed of light 
plays the role of a limiting speed. Although of course, mathematically, there 
is no reason why we cannot consider and calculate effects with such a synchro- 
nization procedure. For example, one can show that such signals would lead 
to causality violations (A. EINSTEIN: Ann. d. Phys., 28, 371 (1907); see also 
C. MoLLER, p. 52, W. PAULI, p. 16) if they propagated in a Lorentz-invariant 
way. If they did not propagate in a Lorentz-invariant way, they would single 
out locally a privileged frame of reference (so-called absolute or ether frame); 
it would then be very difficult to understand why Lorentz invariance has 


proven to be so useful and experimentally verifiable over an enormous range 
of energies. 


(*) However, due to the earth’s gravitational field and rotation, a co-ordinate 
system attached to the earth does not constitute an inertial frame. Clearly a better 
approximation is a freely falling elevator. For simplicity we shall assume ‘we are in 
outer space where we can neglect permanent gravitational fields and that in fact our 
co-ordinate system extends to infinity. Our inertial frames will then constitute those 
frames in uniform motion relative to one another. 
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1°3. — The line element. 


Les us now assume we have synchronized clocks using (1°2.1) or (1°2.2) 
in an inertial frame S, so that the time it takes light to propagate a distance AB, 
is given by 


(1.3) At = ot Ao = o*(Ax? + Ay? + Ae2)}, 


where Az, Ay, Az are the difference in co-ordinates between A and B, and At, 
the time difference as measured by the clocks at A and B. Squaring (1.3) 
and writing it as 


(1.4) ce? At? — Av? — Ay?— Az? = 0, 
we have the quadratic form characterizing the propagation of light signals. 


Passing to the limit At + dt, Ax + dz, ete., and denoting the left-hand side 
of (1.4) by ds? we arrive at the usual expression for the line element 


(1.5) ds? = c? dt? — de? — dy? — de? . 


The quantity dr = ds/e is called the « proper time.» The expression (1.5) re- 
presents a kind of generalization of the Pythagorean formula; it describes the 
infinitesimal « distance » between two events. Note that there is zero dis- 
tance between the two events consisting of a light signal leaving the point A 
with co-ordinates æ,, y,, 2, and at time it, and arriving at B (t,, ©, Yz €) 
That is, for a light signal, ds?=0. We see also if a particle travels with 
velocity v from a point A to a point B, then since dv=v,dt, dy=v,dt etc., 
we have 


(1.6) dc) di, 


where 0°=0+0+v. Note that ds’Æ0 for such a path unless v=c. We 
see also that for v>e, ds becomes imaginary. 

It is now convenient to choose a system of units in which c=1, so that 
we measure all velocities in units v/c. 

As is well known if we now consider another frame S’ in motion relative 
to S with velocity v and look for a co-ordinate transformation connecting them 
which preserves the same form for the line element, we arrive at the Lorentz 
transformation. Let us suppose, however, that we did not perform the Lorentz 
transformation but a Galilean transformation, what form would ds? take? 
(We shall assume the velocity is in the x direction.) We find under the co- 
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ordinate transformation 
(it) 

the line element becomes 
(1.7) ds? = (1 — v?) dé? — 2v dé’ da’ — dx’? — dy'° — dz” 


and setting ds? —0, we have for the propagation time over the path Az’, 
Ay', A, 
vAa' 1 Aa” 


4 
(PERS è | Aa 12 | Ae”? 
ee) = 1— e (1— 0) [1 — 0° oh : 


in contrast with (1.3). We see from this expression that the propagation time 
of light is not independent of direction or of the velocity of the frame. There 
is therefore a fundamental asymmetry between the two frames S’ and S. In 
particular if we send a light signal around any closed path we have: 


1.2 1 +3 
(1.10) par ine an? 


since (v/(1—v?))fdx! vanishes around a closed path. We know, however, 
experimentally that no such effects depending on the velocity of an inertial 
frame relative to another inertial frame have ever been measured. It follows 
that the co-ordinate transformation above does not describe what one phy- 
sically measures in the frame S’. What assumptions were involved in the 
Galilean transformation? Clearly it assumes in addition to the linearity of 
the transformation (which is reasonable on physical grounds) that the rods 
and clocks in 8’ maintain the same length and interval as seen by an observer 
in S, and also two events that were simultaneous in S, 4e. At= 0 were si- 
multaneous in S’, At= 0 since At'= At. This latter assumption is reasonable 
from the standopoint of Newtonian mechanics, since we could in principle 
synchronize clocks instantaneously, there being no limiting velocity in New- 
tonian mechanics. On the other hand, it is untenable in relativity as the velo- 
city of light plays the role of a limiting velocity. 

To arrive therefore at the «correct » transformation we postulate the fol- 
lowing which is fundamental to general relativity: 


da” 
(l= 7) 


ba 
a dy dz" |", 


1) A clock at rest in the primed frame does not read co-ordinate time t', 
but the proper time (interval) 


(1.11) ds == (l=72) dre 
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It follows from (1.11) that our transformation should therefore introduce a 
new time given by 


ct?) T = (1—v?)Ft' = (1 — v2) t . 


The line element now takes the form 


2v da'dT 
518 92 = dT? 12 12 ME 
(1.13) ds? = d We de'°— dy’ — dz”, 
and (1.9) becomes 
» Na’ I ee 
‘ee, AT — (1 — v2)} 1; + Ay + Ae]. 


. We next postulate: 


2) If we measure distance by sending a light signal out and back over 
a given spatial interval, the elapsed proper time is simply proportional to the 
euclidean distance independently of the motion of the frame. Then we must have 
AT=2 Ao’ and this will be the case provided in (1.14) we assume that the 
length of a rod in 8’ is given by 


(1.15) ADE =) 


We have therefore by the above assumptions the following co-ordinate trans- 
formation 


Il 


X = y(a— vi) VA 


(1.16) Deere 
yaa Gel 


and under this co-ordinate transformation the line element takes the form 
17) ds? = 47? — 20dX dT — (1 — vw?) AX2— AY? — dZ?. 


Now this line element and the corresponding co-ordinate system have a great 
many properties of the Lorentz transformation, with the exception that two 
events which are simultaneous in S(f,x,y,2) are also simultaneous in 
S'(T, X, Y,Z), since AT=y-1At=0 if At=0. In particular if we send light 
around any closed path, the time it takes is simply proportional to the length 
of the path, since 


(1.18) AT = vAX + Ac, 
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where Ao = (AX?+AY?+ AZ?) and 


(1.19) par= prari Das da 


since fu AX =0. 

However, the clocks have not been synchronized in the appropriate way 
but rather they have been synchronized absolutely. As a consequence the 
time taken for light to travel a distance AX from A to B is different than 
that taken from B to A. In fact we see that the time taken from A to B is 


2 Sh = 
(1.20) tto; 
and on the return trip 

2 Sai 1—% 
(1.21) mer 


But the average for the two trips out and back is 


(1.22) alae (out) + = (back)) = ike 
or restoring units e_!, and similarly for the other directions. 

However, as we remarked earlier, such a synchronization cannot be achieved 
without either employing instantaneous signals or synchronizing the clocks 
with respect to another inertial frame for which there is no justification. We 
therefore now demand that the clocks be synchronized by slowly moving them, 
the meaning of which will now be made mathematically precise. 

Consider the expression (1.17) for the proper time: We imagine a clock 
to undergo a displacement AX with AY=AZ=0, and in a time AT, then 


(1.23) As = (AT? — 20 AT AX — (1 — 02) AX?)# 
or expanding and keeping only first order displacements, 
(1.24) As = AT—oAX. 


In other words the proper time of the clock changes by an amount, —v AX, 
in addition to change in co-ordinate time. If we therefore introduce a new 
co-ordinate 


(1.25) PÉTER e, 
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this co-ordinate will incorporate the change in proper time and hence for very 
slow displacements AY < AT we have 


(1.26) ds = dt 


FRG 


If we now substitute for 7°, the co-ordinate #, according to the transforma- 
tion (1.25) in (1.17), and introduce 0, =X, y,= Y, 2,=Z, we have 


(1.27) ds? = dt 


Which is once again the special relativity line element. Moreover, we note 
from (1.25) that 


| t= T—oXx 
(1.28) = prié — vy(x — vi) 
= y(t — vx) 


so that t, actually does correspond to the time co-ordinate introduced in spe- 
cial relativity. We see, however, by this treatment that the possibility of 
introducing such à local time (as it was so designated by LORENTZ) depends 
on the possibility of making à co-ordinate transformation such as (1.25). Some- 
times this is not possible as in the rotating co-ordinate frame which we shall 
discuss next. That is to say, we shall find that when we synchronize two clocks 
and slowly move one away, for example around the edge of a rotating dise, 
the clock upon returning to its original location will not read the same time 
as the clock which remained fixed, and moreover, the disagreement will de- 
pend upon the path of displacement. This is clearly not the case for (1.25) 
since 


(1:29) pat, = par since prax =) 


As a consequence of not being able to satisfy a relation such as (1.29) 
it will not be possible to bring the line element in the rotating frame into the 
diagonal form of special relativity, without eliminating the rotation. 

Finally we should note that the usual results of special relativity can be 
obtained from the line element (1.17) and co-ordinate transformation (1.16), 
as we have already shown for the problem of sending light signals out and back, 
provided we always take into account how measurements are performed. It 
is perhaps interesting to see in this manner how general relativity permits us 
to consider more complicated and varied line elements than are encountered 
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in the usual formulations of special relativity. The fundamental rule being 
that in a sufficiently small region of space and time we can, however, always 
«renormalize » the length of our rods, the interval of clocks, and adjust their 
synchronization so that the line element of special relativity holds—although 
in general we cannot do this for finite regions of the space-time manifold. 


CHAPTER II. 


ROTATING CO-ORDINATE SYSTEMS 


In the preceding section we considered a transformation which did not 
leave the line element invariant but introduced off-diagonal terms, in par- 
ticular —2vdX dT. In a rotating co-ordinate system we shall also see such 
terms appear, which cannot be eliminated as in the case of an inertial frame. 

Apart from physical applications, rotating co-ordinate systems play an 
important role in general relativity for two reasons: first, in connection with 
the problem of the relativity of motion, that is, «does the phenomena in a 
rotating frame imply that rotation is absolute? » as maintained by NEWTON 
(the famous ice bucket experiment), «or relative? » as maintained by many 
philosophers contemporary with NEWTON, and most vigorously in the 19-th cen- 
tury by MAcH—we defer a discussion of this very fundamental and difficult 
question to the last section. The second reason is that the spatial geometry 
cannot be Euclidean and this provides one of the basic arguments in general 
relativity for considering geometries other than Euclidean ones. The reason 
for this non-Huclidean structure may be seen from the following very simple 
argument given by EINSTEIN. Consider a rotating disc with radius À measured 
in the inertial frame S relative to which the dise is rotating. Now consider 
a small rigid rod located on the circumference of the disc. Relative to S the 

rod appears to be contracted because of the Lorentz contraction. Hence the 
number of rods one can place around the circumference will be greater than 
if the disc were at rest. On the other hand, a rod placed along the dise will 
experience no contraction since it is translating in a direction perpendicular 
to its orientation. Hence the circumference ©’ in the rotating frame will satisfy 
the inequality OC’ > 27 and consequently the geometry is non-Euclidean. Be- 
cause of this non-Euclidean character it is not possible to find a Spatial co- 
ordinate transformation relating the geometry of the rotating disc to the geo- 
metry in the non-rotating system. In particular if the Spatial distance in S’ 
is given by do’ = f(dw', dy’, de’) while that in the non-rotating frame is 


À 


do? = dæ?+ dy?+ dz? then there does not exist a co-ordinate transformation 
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ahi 2 mi PAS, : A ee cee a 
= fa(0,Y, 2) à Y = fix, y, 2), a= fat, y, 2) 


which enables us to transform the line element do? into do”. 

Nevertheless, we can arrive at do’ by a co-ordinate transformation on ds 
in the following way. First of all let us write the line element in cylindrical 
co-ordinates, the axis of rotation being along z; then we have with 


o= (wit y)! t=t, 


& 


x 
p= tg, 
Yy 


2 


(2.2) ds? = dt? — do? — 0? dy? — dz?. 


To transform to the rotating co-ordinate system we employ the following 
transformation 


Gora e=@ 
(2.3) 


@ 


p=p—- at = 


The transformation in some respects resembles the Galilean transformation, 
and implies that the clocks in the rotating frame have been synchronized abso- 
lutely with those in the non-rotating frame. The reason for this type of co- 
ordinate transformation will become clear below. Then, under the above trans- 
formation, the line element takes the form 


(2.4) ds? = (1 — 9m?) dt’? — 20" dg’ dt — de” — 9 dp” — de”, 

from which we see that the proper time of a clock at rest in the rotating frame 
is given by 

(2.5) ds = (1 — 0’*w*)? dt, 


as we would expect from special relativity. That is to say, a clock with 
radius arm o’ is travelling with speed o'w, and (2.5) expresses the apparent 


LR 


slowing down of the clock as seen by an observer in the non-rotating frame. 
However, we cannot as in the case of uniform motion introduce à time given 


(2.6) ve =|a- wo) dt’, 
since the right-hand side cannot be integrated, 9° and ¢’ being independent 
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co-ordinates, alternatively stated, d7=(1—w*o”)' dt’ does not define a total 
differential. To be sure we could define a time given by 


20 IONE 


but then d7 #(1— 0)? dt’ and the line element becomes very complicated. 
We therefore stay with (2.3) but realizing that #' is not the time read by a clock 
at rest in the rotating frame, and rather we must use (2.5). Note that according 
to (2.5) all the clocks at a given distance o' from the axis of rotation have the 
same proper time interval as we would expect on physical grounds. Also, a 
disc cannot rotate with speeds o'@>1 since the proper time becomes ima- 
ginary, indeed even the case 0'©= 1 is forbidden since a mass particle located 
at this distance would, from special relativity, possess infinite energy. 

On the other hand, this limitation on rotational velocities would seem to 
conflict with simple astronomical data. Thus, let our co-ordinate system be 
fixed with the earth, so that the stars are now to be regarded as in rotation 
about the earth, and consider the star « Centauri A: its distance is of the order 
4.3 light years or about 4-101* cm, the earth’s angular velocity w is 7-10-5 rad/s, 
and hence the apparent velocity of the star is 104c. How is this possible? 

The solution lies in observing that a system of co-ordinates attached to 
a rotating frame does not constitute an inertial system and special relativity 
makes no statement about how fast a body may travel relative to a non- 
inertial frame, provided one cannot use these bodies to transmit signals with 
speeds greater than that of light. (In particular, one cannot use observations 
of the fixed stars to send telegrams with speeds exceeding that of light.) On 
the other hand the «fixed » stars themselves apart from their random motions 
do constitute the fundamental inertial frame, with respect to which we under- 
stand a body such as the earth to be in rotation. (Strictly speaking the effect of 
nearby masses such as the moon perturbs the local inertial frame according 
to general relativity. The effect, however, is fortunately very small.) 

We should therefore expect that seen in the rotating frame the proper time 
of the «fixed stars » is unchanged. To arrive at this we note that since the 
earth was taken as rotating with angular velocity w relative to the Stars, they 
consequently rotate with angular velocity —w relative to the earth, hence 
for the stars, substituting 


(2.8) dp = — wat’ 


(so that, note, they have a speed |o'(dg'/dt')|= oa > 1) in (2.4) we find along 
any trajectory do=0, dz= 


(2.9) GRR Ge 


and the proper time is unchanged as it should be. 
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Let us now consider the propagation of light signals in the rotating frame. 
Setting ds?=0, we find 


12. I 12 2 3 
(2.10) (1— w%o”)t at'— € Dea PNR BE aga | 
(1 — 00) (1 — 089°) 


The expression under the radical is the proper distance between two points 
(2.11) do? = (ao na, de) 1 


It is clearly non-Euclidean. In particular if we compute the circumference 
of a path at a distance R we have 


2a 
(1 — w?R?)?’ 


(2.12) do'= 


do'=dz'=0 


and we see, as followed from Einstein’s simple argument, the circumference ©” 
satisfies 
0’ 1 


Da les = > 
2aR (1— ©R2) dà 


Note that the radius is simply given by 


(2.14) do fav Re 


dp'=dz'=0 


in conformity with the fact that a rod oriented along the radius does not 
experience the Lorentz contraction. Once again we would like to stress the 
fact that it is not possible to find a transformation of the form 


P = f(0'; ®' 2), D — f2(0", P' a), Z = f,(0', pe), 


which eliminates the non-Euclidean structure of the element of distance do’. 
Although of course we can eliminate it by employing all four co-ordinates. 
However, one could easily imagine a situation in which even a co-ordinate 
transformation between the four co-ordinates would not suffice to bring the 
line-element into Euclidean form. When this is the case we have a « perma- 
nent » gravitational field, such as that of the earth, the sun, etc.; in these 
circumstances it is not possible to eliminate the «forces » of the gravitational 
field everywhere by a co-ordinate transformation, although we can always 
make space flat «locally » as we shall show later. 
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Let us now consider the following experimental situation. We are on the 
circumference of a rotating disc and by means of mirrors we manage to send 
light around the circumference in opposite directions. What is the difference 
in time taken by the two light paths? Employing (2.10) we see that around 
the light path in the direction of increasing g' we have: 


Rw: 27 “27 


(2.15) Oe = pe api 


and around the path in the opposite direction 


R2o:27 | R-27 


‘ a Mm R2\3 Af = = - 
(2.16) (1— w?h?)# At = ok)! 1 — wk) 


Hence subtracting and denoting (1—w?k?)* At! by AS’ the proper time on 
the circumference, we have 


An Ra 
(2.17) AS = Ag RER 
O © Gok 


Thus, apart from 2-nd order effects, the time difference is given by, restoring 
units, 


(2.18) 4x (area of path) xw/c?. 


This effect has actually been measured both for a rotating platforn, where it 
was possible to control ©, by SAGNAc in 1914, and for the earth by MICHELSON 
and GALE in Chicago in 1925. In the case of the Michelson-Gale experiment 
light was sent around a rectangle of approximately a kilometer on a side and 
the difference in time measured by an interferometer technique. The experi- 
ment may be looked upon as the optical analogue of the Foucault pendulum, 
whereby one determines the earth is in rotation by observing the precession 
of a pendulum without direct reference to the fixed stars. 

Let us now return to the problem of the slowly-moved clocks. Instead of 
sending light signals around the circumference, let us consider two clocks: 
one, fixed at the point A on the rotating dise, and the other synchronized with 
the clock at A and now slowly displaced around the circumference of radius À. 
Then the proper time of the slowly displaced clock is given by: 


Rw Ag’ 


2.19) AS = (1— w?R?)# At — re 
( | ai (1 — w2R?)t 


And hence after displacing by Ap'= 2a the clock upon returning to A differs 


o 
o 
Lei 


ee 
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from the reading of the clock at A by (restoring units). 


STEP ra 
2rkR?wc 
= Spe 


(2.20) ASTE Aia 
(1 — wR?) 


O 


So that the Michelson-Gale experiment could equally as well have been 
performed using slowly-moved clocks. For example, if we take a path around 
the equator R=6.4-10%km, neglecting the 2-nd order term, we have 
AS — AS, ~ 10-78, which could easily be measured, although because of 


O 


the information we now have from other experiments, it would only serve 
as a further check of the basic ideas. 


CHAPTER III. 


THE PRINCIPLE OF EQUIVALENCE 


Perhaps one of the most fundamental questions one can ask concerning 
the general theory of relativity is, « Why does the theory concern itself so 
much with gravitation? » We have given a partial answer to this in the Pre- 
face, but we now wish to develop this point more fully. 

As is well known in Newtonian mechanics and gravitational theory, the 
acceleration imparted to a particle by a gravitational field is independent of 
the mass of the particle. We have, for example, in a uniform field in the nega- 
tive z-direction, for a particle of mass m freely falling in it, 


ORE 
(3.1) M — = — MY, 
dt? 
| d?a d?y 
(3.2) Mm o Mm a (0) — 


d?8 
dt? 


(3.3) SII 


The curious thing about this equation is that the mass does not enter in it be- 
cause of the equality of gravitational and inertial mass. This is quite dif- 
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ferent than the case of the Lorentz force: for a particle of charge e and mass m 
in a uniform electrostatic field &,, the equations are (neglecting relativistic 
corrections) 

dz 


dt? 


de 4, 2 _ 


(3.4) Mes Ne m den 02 m ter: 


We see the acceleration imparted to the charge depends on the ratio e/m. As 
a consequence if we have a collection of charged particles of various e/m ratios, 
even neglecting their mutual interactions, they will show a tendency to sepa- 
rate in the presence of the uniform field &,. This forms the basis of a simple 
e/m spectrometer. On the other hand, if we drop from a given height at the 
same time, a number of particles having different masses m,, m,, ms, ... etc., 
they will strike the earth at the same time, showing no tendency to sepa- 
rate. As a consequence if we adopt a system of reference accelerated down- 
ward with an acceleration g, the particles appear at rest in this frame, and 
do not appear to be acted on by any forces of the gravitational field. 
Consider now a rocket ship in free space not under power; it obviously 
constitutes an inertial frame. A particle of mass m in the rocket ship there- 


fore satisfies the equations 


d?22 
(3.5) m de = () , etc. . 


Let the rocket be turned on, so that the rocket ship undergoes a uniform ac- 
celeration in the z direction, corresponding to the co-ordinate transformation 


(3.6) = 2 bat 
and hence 

3 0224 nd 
LE, dt? dt 


Substituting (3.7) in (3.5) we have 
(3.8) i == = mu or os 


Thus the acceleration of the particle in the rocket ship would be independent 
of its mass as in the gravitational field. Moreover, if we imagine the particle 
attached to a spring scale it would appear to have acquired a weight ma just 
as though the rocket ship were at rest on the surface of the earth and the par- 
ticle was being «pulled down » by the gravitational attraction. (Indeed it is 


(9) 
© 
m 
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this property of inertia which has led to the suggestion of introducing an 
effective gravity into artificial satellites, e.g. by rotating them.) 

We see therefore that because of the principle of equivalence there is a 
kind of relativity of acceleration. An observer whose accelerometer says he 
is undergoing a uniform acceleration cannot distinguish whether he is 


a) in free space accelerating relative to the « fixed stars », 
b) at rest on the surface of a gravitating body. 


And a similarly when his accelerometer reads zero, he cannot distinguish 
whether he is 


a) uniformly translating in outer space relative to the stars, 


b) freely falling in the earth’s gravitational field and hence undergoing 
an accelerated motion relative to the stars. 


These important results follow from Newton’s laws of motion and the equi- 
valence of inertial mass and gravitational mass. 

However, if we now consider special relativistic effects, since the energy 
of a body increases when it is in motion relative to a given frame, and hence 
its inertial mass (from the equivalence of inertial mass and energy via the 
famous relation m= Æ{/c?), it would follow that unless the gravitational mass 
-also increased, we would no longer have the principle of equivalence holding 
exactly. (It is an interesting feature of Einstein’s approach that he wished 
to keep this very fundamental property of the Newtonian theory, just as 
in developing special relativity he kept the relativity of uniform motion.) 

This distinction would lead to an inconsistency with the conservation of 
energy which we may arrive at by the following argument. 

From special relativity, if a system’s energy changes by AZ its inertial 
mass also changes by Am=AE/c?. Hence if we imagine a nucleus at rest 
in the gravitational field with mass M,, and consequently gravitational mass Mo, 
to undergo a transition in which it emits a photon of energy # to a final state 
M,=M,—H# (neglecting recoil, diminution of recoil is of importance in con- 
nection with the Mossbauer effect and experimental verification of the prin- 
ciple of equivalence), it will have after the transition a gravitational mass W, 
as may be verified experimentally. We therefore expect that the gravitational 
mass of a body will depend on its energy content. Now since the emitted 
photon has an energy E= hy, it should therefore have a gravitational mass 
m = hv/e?, and therefore upon «falling» in a uniform gravitational field of 
strength g, acquire an additional energy (hy/c)gL, where L is the distance 
it «falls». Its final energy will therefore be 


h gL 
(3.9) hy + a gL = hy (1 +5 | 


e? 
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If this were not the case we would arrive at the following contradiction with 
the law of conservation of energy. Let us imagine the above nucleus in the 


initial state M, to fall a distance L before decaying, then it will acquire an 
additional energy M,gLe? and its total energy before decaying will be 


(3.10) E, = Mc? + MygLe? . 


Now consider the energy obtained if the nucleus first decays and then falls, 
the radiation will also be assumed to be received at distance L below its point 
of emission. If the photon did not acquire an additional energy given by (3.9), 
the total energy received would be 


(3.11) E, = M,c?+ M,gLe? + hr. 
Subtracting (3.11) from (3.10) and using hy =(M,— M,)c? we have 
(3.12) LH, — EH, = (M,— M,)gLe? . 


On the other hand, if we take the photon’s energy to be (3.9), so that instead 
of (3.11) we have 


15 
(3.13) Bi= My MgE + ho (1 + Zi 


and we have E,-E,=0, so that energy is conserved. 

We shall now show how we may obtain (3.9) from an elementary application 
of the Doppler effect and the possibility of replacing a system of co-ordinates 
at rest in a uniform gravitational field by one in uniform acceleration. Con- 
sider two observers A and 5 at rest in a uniform gravitational field of mag- 
nitude g with B located vertically above A at distance L. Now let a photon 
of frequency v be emitted at B and absorbed at A. What is the frequency of 
the photon measured at A and what is its energy? 

According to our basic principle discussed previously the system is equi- : 
valent to one in which there is no gravitational field present, but the frame 
in which A and B are at rest is undergoing a uniform acceleration in the posi- 
tive z-direction. Now consider a photon emitted from B with frequency figs 
then in travelling from B to A, the observer A has in the meantime experienced 
an acceleration lasting a time t,,= L/c (approximately), and has acquired a 
velocity V— gL/e and hence by the Doppler effect the frequency of the photon 


i=) 
= 
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seen by the observer at A will not be v,, but 


610 ra) Je) 


and hence the energy will be £, = hr, = E,(1+(gL/c?)) as in (3.9). Thus the 
photon will have acquired energy as a consequence of its fall in the gravitational 
field, and similarly on rising it will lose energy. 

This result will enable us to make a crude estimate of the deflection of 
light in a gravitational field; however, before doing this let us call attention 
to an important consequence of (3.14). 

If we imagine an oscillator sending out a wave train from B to A, then 
in time At, the number of waves which has left B must be v, At,; this wave 
train will appear at A during time At,, and the number of waves will be 
y,At,. Since the total number of waves is clearly the same, we must have 


(3.15) vy, At, =v, At, 


TABLE I. — Red-shift of 0-type stars in stars clusters. 
«These stars may have masses of the order of 100 solar masses and radii about 4-5 
times that of the sun. The gravitational red-shift would thus be about 21 times 
greater than on the sun or near 10 km/s which is about the average value observed » 
(from R. J. TRUMPLER). 


Cluster Star M SDs Red-Shift (km/s) 
J.C. 1805 2 == 4.5 07 1942.2 SB. | 
3 — 4,9 07 INA RO 929.0) 
J.C. 1848 ] _ 6.0 07 +10.4+4.8 S.B. 
la — 4.4 08 + 4.6-+5.4 
2 459 09 We =, 6.2ero 5 
NGC. 2244 9 — 5.0 07 sy 6 8-143 
15 RES LC 06 IS OT | 
8 | —43 08 | 6.4416 | 
NGC. 2264 1 AG 07 No Se 
NGC. 2353 | 10 09 16.1-£1.6 8.B. 
NGC. 6231 50 4.6 08 | +16.442.6 
NGC. 6604 1 6.010 08 | 13.6 +4:1: Var: Vel. 
NGC. 6611 1 ITA OI 06 eek Oe 
2 — 4.8 08 | + 4.1+3.7 Var. Vel. | 
3 RAT 08 + 9.9+2.4 | 
NGC. 6823 1 - 4.3 08 RMSELI 62234 | 
2 — 4.0 09 a St, py ea | 
NGC. 6871 5 Dip BO +15.6+1.6 
18 stars in 10 clusters + 9.8 km/s 
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and hence using (3.9), it follows 
L 
(3.16) AL, (i + to) Are 


From this we can conclude that if two clocks are at rest in a gravitational field, 
they will run at different rates depending on their location. In particular a 
clock at the surface of the earth will run more slowly than a clock located 
on a mountain. 

The expression (3.14) has been verified astronomically in the case of the 
sun and a variety of stars, although there is some experimental uncertainty 
in the result, the accompanying Table I taken from the talk of R. J. TRUMPLER 
(Jubilee of Relativity Theory, in Helv. Phys. Acta, Supplementum IV, Basel, 
1956) would seem to leave no doubt that the effect is definitely present and 
of the right order of magnitude. 

In addition to the above measurements, recently using the Méssbauer effect, 
the shift has been measured terrestrially most accurately by R. V. PounD 
and G. A. REBKA jr., Phys. Rev. Lett., 4, 337 (1960). Since the height Z of their 
apparatus was ~ 25m the frequency shift was of order 10-5 in comparison 
with the shift of order 3-10-5 in the above table. Thus the shift has been 
checked experimentally over a range of 101 in Ap/y. 

The above formulas are based on a uniform gravitational field; however, 
a natural generalization is to replace gL by AU, the difference in gravitational 
potential between B and A. Hence, if we choose the arbitrary constant in 
the potential such that it is zero at infinity, it follows from (3.16) that the 
time interval of a clock in a gravitational field in terms of the time interval 
At of a clock at infinity is 


(3.17) At, = (1 + =) At. 


That is we have At, ~ (1 — (gL/c?)) At, (our calculation is not valid to higher 
order terms). Now inserting the expression for the Newtonian gravitational 


potential, U,=—GM/r, where M is the mass of the body,we have 
(3.18) At, = (1 = a) ING 
rc? 


An expression we shall obtain later in a more rigorous way. 
Now it follows from the basic principie we employed earlier, that since 


a 
~ 
Del 


Le 
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At, represents the time read by a clock at rest at A, this must be the proper 
time, that is for a clock at rest 


(3.19) ds? = (1 - 
apart from higher order terms. Therefore to a first approximation we would 
expect 


2GM 
| c2 dt? — dx? — dy? — dz’, 


(3.20) ds? = (1 af 


re? 


and we see that the velocity of light, obtained from ds=0, is no longer «e » over 
finite regions. Because of this dependence of the velocity on the location, a ray 
of light passing near a body of mass M will be deflected. From a Newtonian 
standpoint, this can also be understood in the following way. We associate 
with a photon of frequency hy, a gravitational mass hy/c? as above, then on 
passing by the body of mass M it is attracted by the gravitational field, just 
as is a Newtonian particle. However, as we indicated earlier this picture is 
misleading, since the geodesic paths followed by a body, i.e. ô[ds = 0 do 
not represent paths for which the body «feels » a force. The error lies in 
extrapolating from the Euclidean geodesics of flat space, or regions where the 
gravitational field is negligibly small, to regions where the geodesics have 
become significantly non-Euclidean, and then regarding these departure from 
the Euclidean geodesics as being due to « gravitational forces ». 

To calculate the deflection of light we should therefore solve the equations 


| 5fas =e 05 
(3.21) | 


However, the exact method of solving such equations belongs to a later chapter; 
moreover, since our expression for ds is only approximate, such a solution is 
not necessary. Rather let us proceed in the following way. We have seen 
that a photon in a gravitational field behaves very much as though it were a 
Newtonian particle of mass m= hy/c? travelling with speed €, 2.e., a momen- 
tum hy/c. Let us therefore calculate the deflection experienced by a Newtonian 
particle travelling with speed c in a Newtonian gravitational field. It may 
be shown this gives the same answer as one obtains from (3.21), although strictly 
speaking of course, such a picture is untenable from the standpoint of Huyghen’s 
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principle. Indeed, the effect of the gravitational field is to increase the index 
of refraction in the region of space so that light gets deflected towards the 
gravitational field, and therefore the speed of light is less near the gravita- 


star al co 
apparent 
position 
deflection 
perihelion 


approx. hyper bolic 
light path 


OEarth 
le, Ile 


tional field than at great distances. On the other 
hand, a Newtonian particle as it approaches the 
sun, or gravitating bodies, increases its speed. It 
is for this reason that when EINSTEIN calculated 
the effect he was very careful to use Huyghen’s 
principle and not the method we shall present. 
below. The student should read for himself this 
very elegant and succinct derivation of the effect 
by EINSTEIN. 

The physical situation is given in the above 
picture. We imagine now a Newtonian particle of 
mass m moving in a plane with polar co-ordinates 7, 
g. The energy integral is 


ee LUE GME 
— 7? + — pg? — — = sai 


(3.22) + e 


? 


where 7, g denote radial and angular velocities, M 
is the mass of the deflecting body, £ is the energy, 


m the mass of the particle. The angular momentum integral is (Kepler’s law) 


(3.23) 


where A is the angular momentum. 
particle is at the perihelion rp ~ e. 


RE 
rep ==, 
Mm 


To estimate A, we note that when the 
We therefore set A = mRe and hence 


TPE Or 


Hence, we may eliminate time derivatives above using 


to write (3.22) as 


1 


D 


(3.25) 


D a ALU 

dona dai 
dr\?Re 1Re GM E 
| HOUR rm” 


and now introducing u=+-1, the above becomes 


(3.26) 


1 (du 
2\dq 


2 Jl 
| Re + = Reu? — GMu = 2 ; 
2 m 


4 


4 
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which upon differentiation yields 


du GM 
dg? — @ Re? 


Newton’s equation for planetary motion in polar co-ordinates for A = mRe. 
Now the solution to this equation consists of the solution to the homogeneous 
equation d°u/dp°?+u= 0, or the straight line r cosy = R 


i 
(3.28) U= 08, 
plus the inhomogeneous part w= GM /c?R? corresponding to pure circular mo- 
tion r= constant. The total solution is (very nearly) 


(3.29) u = PR cosy +GUJeR: , 


since R-1>>GM/c?R? the above is the equation of a hyperbola (for a planet 
the second term is much larger than the first corresponding to an ellipse). 
Now what we desire is the angular deviation of the two asymptotes to the 
hyperbola. Therefore since these asymptotes correspond to w= 0, we have 


R- cosy, + GM/eR? = 0, 


(3.30) 
ER cos 9: + GM/e?h? = 0. 


Now since 9, and ®, differ only by a small amount from + 2/2 and are 
clearly equal in magnitude, we have, using cos (4+(z/2))=—sin4 »—A 


(3.31) Aw GM/ek 
and the angular deflection from the lower asymptote, projected backwards, is 
(3.32) a= 24 R24M/cR . 


This result is too small by a factor of two, and to get the correct answer one 
must resort to the full machinery of the general theory. Although ScHIFF 
(American Journal of Physics, 28, 340 (1960)) has given some heuristic argu- 
ments recently using the principle of equivalence to a higher order than we 
have here to obtain the correct deflection without using the complete theory (*). 


(*) See also N. L. BaLazs: Zeits. Phys., 154, 264 (1959), and the work of W. Lenz 
(1944) described in A. SomMERFIELD: Lectures on Theoretical Physics, vol. III (New 
York, 1952). 
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However, it should be understood that once we adopt a line element dif- 
ferent from that of special relativity (i.e., one that cannot be obtained by a 
co-ordinate transformation from ds?— dt2?— dx?— dy?— dz*) we are no longer 
in the framework of Minkowskian geometry or of special relativity and one 
does not have an explanation of the phenomena unless one has a well-defined 
mathematical scheme for arriving at such a non-Euclidean line element. Thus 
it should be remembered that EINSTEIN used such elementary arguments as 
we have given above before he had constructed the general theory and as a 
gnide to what the theory should contain, so that such arguments should be 
looked upon as bearing about the same relation to the general theory as Bohr’s 
quantum rules bear to quantum mechanics. We therefore proceed to the 
construction of the full theory, which in addition to giving us the change in 
clock rate, deflection of light, will give us the precession of the perihelion of 
Mercury. 

Finally in connection with the experimental side of the principle of equi- 
valence, in addition to the above references, we have the famous work of the 
Hungarian physicist R. von EòTvòs (Ann. of Phys., 59, 354 (1896)) who de- 
monstrated experimentally the equivalence of inertial mass and gravitational 
mass to about one part in 108. This was a considerable improvement over 
the work of BESSEL (around 1830), based on considering the period of pen- 
dulum. For a simple pendulum one has from Newton’s laws T=2nV Ig, 
where / is the length of the pendulum, but if inertial mass and gravitational 
mass were not equal one would have T— 2x Vm,lm,g, where m,, m, are the 
inertial and gravitational masses respectively. Clearly as we change m, 
unless m, changes proportionately, the period will also change. By this method 
Bessel verified the principle of equivalence to a few parts in 10°. Recently 
R. H. Dicks of Princeton has been engaged in a program to improve on the 


results of EOTV6s and to see whether or not the principle is correct to a higher 
order of accuracy. 


CHAPTER IV. 


TENSOR CALCULUS AND RIEMANNIAN GEOMETRY 


41. — The requirement of general covariance. 


In the preceding sections we saw that the quadratic differential form or 
line element ds? provided us with a convenient tool for studying the propa- 
gation of light in different co-ordinate systems. In performing these co-ordi- 
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nate transformations and in writing down the line element no special notation 
was used, in order to avoid confusing mathematical notation with the physical 
ideas involved. However, although it is possible to continue in the preceding 
fashion, we enormously simplify the presentation by introducing covariant 
notation and the calculus of tensors. We shall not attempt to give a rigorous 
mathematical presentation of this discipline, but rather develop it side by 
side with the physical ideas. 

As you know the fundamental postulate of special relativity is: 

The laws of nature take the same form in each of two uniformly moving 
frames. 

As a consequence of this postulate one is restricted in special relativity to 
considering co-ordinate transformations which leave ds? form-invariant. These 
transformations constitute the group of Lorentz transformations, and in special 
relativity one is restricted to this group of transformations. However in gen- 
eral we are not merely confronted with uniformly moving frames, but frames 
in arbitrary states of motion, for which there is consequently no Lorentz trans- 
formation connecting the two frames. How are we to formulate the laws of 
nature in such frames? Moreover frequently we are interested in using other 
systems of spatial co-ordinates besides those which leave ds? form-invariant. 
For example in Chapter II, we introduced cylindrical co-ordinates for which 
the line element assumed the form 


(4.1) ds? = dt, — do? — 0? dy? — dz?. 


Such a transformation, by definition, is not a Lorentz transformation, never- 
theless we can certainly describe how light propagates in such a co-ordinate 
system. Indeed Maxwell’s equations written in vector form 


isl) ano ae a aT <= 041) 


(4.2) 

V-B—0 VXE + 0,B = 0 
make no reference whatsoever to the system of spatial co-ordinates employed. 
They hold in arbitrary systems of spatial co-ordinates. 

On the other hand we have learned from special relativity that time plays 
the role of an additional co-ordinate, so that we are no longer dealing with a, 
three dimensional manifold and an absolute time as in Newtonian mechanics, 
but a four dimensional manifold, the Euclidean element of spatial distance do 
and the element of time being combined to form a four dimensional element 
of distance 


(4.3) ds? = dé? — do? . 
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However because of the difference in sign of the way dt? and do? occur in ds?, 
it is clear that they do not form a 4-dimensional space in the Euclidean sense 
which would correspond to a line element of the form 


(4.4) ds? = di? + do? . 


This distinction is very important since in the latter case, the propagation of 
light would be described by dt*4 do? =0 and hence dt=0, do=0 in other 
words light would not propagate (*). On the other hand setting (4.3) equal to 
zero, we have At = Ao correspond to light propagating through a distance Ao 
in a time At. Apart from this distinction, however, we may look upon time 
as simply an additional co-ordinate that enters into the transformation along 
with the three spatial co-ordinates. 

We are therefore led by the above arguments to the conclusion that the 
laws of nature should be stated in a form that hold under arbitrary co-ordinate 
trasformations, treating time as a co-ordinate along with the spatial co-ordi- 
nate. (Postulate of general covariance.) The mathematical tool for accom- 
plishing this will be the covariant tensor calculus invented by HRICCI to 
handle precisely this kind of problem. 


42. — Covariant notation. 


As we have seen the line element under a co-ordinate transformation as- 
sumed a variety of different values for the coefficients of the df?, dtdx, dx?, 
dy”, de? ete. To describe this situation we write it in the following notation, 
and we sum over repeated indices designating the four co-ordinate differentials 
Dyeds! "(4 —0, 172,73) 


(4.5) ds? = g,,da" da”. 

The g,, is called the metric tensor: metric, because it provides the relation 
between the co-ordinate differentials which furnish the differential element of 
distance (**) between events, and tensor because of the way in which it behaves 
under co-ordinate transformations. The reason for writing subscripts below 
on the metric tensor, and above on the co-ordinate differentials also arises 


() We are of course excluding imaginary values for the co-ordinates. Alterna- 
tively if we inkowsk1’s trick i = 
ea = Minkowski'8 trick of setting xy=ît we exclude real values of »,. 
i DI of course possible to consider generalized spaces in which there is no 
element distane fi as is dc 7 i 
nt of distance defined, as is done by the topologists. Also, needless to say, 


although we have restricted ourselves to four co-ordinates the notation itself involves 
no such restriction. 
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because of the way in which they transform. As we have seen the 9,» are in 
general functions of the co-ordinates and not simply constants as in special 
relativity. 

On the other hand we have seen from our earlier discussion that in general 
what a clock or rod measures does not depend only on the co-ordinates but 
on the g,, as well. Indeed as we have seen the proper time of a clock at 
rest at a particular point will not be da, but 


(4.6) ds = V Goo (dx)? : 


It follows that we must restrict ourselves on physical grounds to co-ordinate 
transformations for which (*) 


(4.7) hyve 


Similarly upon setting ds*—0, we have for the proper time to propagate 
through Az*, 


Gi Ax? = — gog het + Vy Aa Agr’ , Gg =e) 
(4.8) _ GorYos 
y 


The y;; represent the metric for the element of spatial distance, as can be 
seen if we consider à situation in which 


;Na' 
(4.9) pe a 


4 
Yoo 


By our previous arguments a rod of co-ordinate differential Ax’ does not 
have a proper length Aw’, but a proper length 


(4.10) Vy, (Aa’)? . 


For example in cylindrical co-ordinates a rod located along AO, doesn’t have 
length A, but r AO. Since on physical grounds the proper distance between 
two points should be positive definite and locally reducible to the Euclidean 


(*) Sometimes in the literature one finds this requirement reversed i.e. goo< € ; this 
is because the authors define their ds? as the negative of our ds?. Also some authors 
have used « light-cone » co-ordinates in the discussion of gravitational waves; in these 
co-ordinates g,—0. The above restriction cannot be maintained everywhere in the 
gravitational field of a point mass for some co-ordinate systems. Thus the above 
restriction may have to be qualified with further development of the theory, or, alter- 
natively, it will serve to greatly restrict the class of allowed solutions, and energy- 
momentum tensors. 


> id 
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distance, the quadratic form satisfies 
(4.11) vi Ax’ Axi > 0 


unless all Aw‘. Hence the 3X3 matrix y,,; must be positive definite 


| | | Vil Vie V13 
IN Vie | k 
(4.12) re ME 00 EU 201 
| Var Vo 
Vai V32 Y33 


We are therefore restricted to co-ordinate systems for which (4.7) and (4.12) 
hold. For example if we try to transform time into a space co-ordinate and 
space co-ordinate into time (4.7) and (4.12) are violated. By the above method 
we therefore arrive at our systems of allowed co-ordinates, it being understood 
that, in general, co-ordinates do not have any physical significance but only 
certain « proper » measures of them such as in (4.6) and (4.10). 

Let us now adopt a g,, satisfying the above restrictions and perform a 
co-ordinate transformation, and see what form it takes under the transfor- 
mation. We have under a co-ordinate transformation from de” to da” 


dal 
(4.13) Cie = el da, 
and hence substituting in ds? 
da \ {da 
4.14 A0 | dc? 
( ) 8 I (o (ar da da 


So that the metric tensor in the primed frame is given by 


; Oxk\ | Om” 
(4.15) uv = fl (| Juv . 


Quantities which transform in this fashion are called covariant tensors of the 
second rank (two indices). We see also that since Inv = Ion it is a symmetric 
tensor. On the other hand, the da transforms from (4.13) as 


dg 
esa 
(4.16) Cp = | > dat . 


We say it transforms as a contravariant tensor of the first rank, or contra- 
variant vector. 


Remark. one should carefully distinguish between the transformation prop- 


i=) 
D 
Ge] 
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erties of the co-ordinate differentials and the co-ordinates themselves. Thus 


æ'* is not a tensor (even though our notation would imply this (*)) since 


(4.17) aes Ota eee) 


and this function is not in general the same as (4.16); only in the case of linear, 
homogeneous transformations are the two transformations the same, An ex- 
ample of a quantity transforming as a covariant tensor of the first rank is 
furnished by considering the gradient of a scalar function of the co-ordinates, 
say ®. Thus 


oo) da — da \ dg 


oD op | i) 
app ’ 
or denoting 0@/dx'" by Vi we have 
da! 


Dan 
(4.19) Ko (23) Ve: 


Any vector transforming as in (4.19) we say transforms as a covariant tensor 
of the first rank. 
° . rl . . 
Now consider the expression V'“V, under a co-ordinate transformation 


> Cu | CT EMI 
(4.20) LOUE (a) (a) Elena 
But clearly since 
| du _ , Ca dat 
(4.21) da’ = CT: GA = apie Oph dz’, 
we have 
do 00 ll, ve 
(4.22) on dar A) =0, 4A ? 


where 6; is the Krénecker-d symbol. It is a « mixed » tensor and is the same 
for all observers since 


0x'}\ | da” da'4 dal 
1A DRE II À 
15029) fesa (=| (ar) aes dae Que ee 


(*) Some authors attempt to avoid this by writing the co-ordinates with the sub- 
script below i.e. x}, but then since x} is not a covariant tensor this does not make 
for greater logical consistency than writing it above which greatly facilitates the 


notation. 
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Hence 


SA) VALE vw 0: VAT = eae . 


An expression such as V’V, is described as an invariant scalar product, it 
is a tensor of rank zero since denoting it by S 


(4.25) S= VV = VV, =S. 


We see from the above that tensors of higher or lower rank may be formed by 
multiplication of tensors (it being understood that the argument of the tensors 
in the product is taken at the same point), for example, we can form the 
« mixed » tensor 


(4.26) DRE ASP OLDS ia Ee (Ae 


So far we have not directly related covariant and contravariant indices. We 
define for a contravariant vector A’, the covariant vector given by 


(4.27) dig 4% 


Then clearly A“A4,= g,,4"A” in analogy with the line element ds?=g,,dx"dæ”. 
Similarly given a second rank contravariant tensor 7", we may define 


(4.28) Ii ip de 


and so on. As may be easily verified A,, 7,, defined in this way transform 
as covariant tensors of the first and second ranks, and moreover AAT ee 
are invariant tensors. 

As may be easily shown from our assumptions (4.7) and (4.12) the deter- 
minant of g,, cannot vanish (since det Iv= 9 =— Joy and Joo, y > 0) (*) hence 
we may define an inverse to g,,, the contravariant metric tensor g which 
satisfies 


(4.29) G29, =D 


v 


Since by definition the matrix with components M® the so-called minor to 
I Satisfies 


(4.30) 9, M = go? 
we have g2— Mg. 


(*) Apart from exceptional cases, such as polar co-ordinates at the origin. 
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43. — Tensor densities. 


In addition to quantities transforming as tensors we in general encounter 
quantities transforming as 


(4.31) Ty = 


where W is a number and |0x/0x'| the Jacobian. Such quantities as T are 
called tensor densities of weight W. Thus an ordinary tensor is a tensor den- 
sity of weight 0, the most frequently encountered density being of weight 
W=1. To arrive at such a quantity consider the integral 


(4.32) ie dx da’ dx? daz’. 


In special relativity this integral defines an invariant if £ is a scalar, 
since under a Lorentz transformation dx°dv1da? da* > da’? da'tda” da’*. However, 
in general relativity for the integral to be an invariant it is necessary that 
£ be a scalar density of weight one. For, consider the transformation to a 
new co-ordinate system: (*) 


0® | 
(4.33) dx° da! dx? da? = | Di | de da dada =, 


and we must have therefore 


0% 
CHA 


(4.34) Gia 


Let us now find an expression for the invariant volume element. We have, 
since the transformation connecting the metric tensor satisfies 


; 0x1 \ ( dae 
(4.35) Iw = on™) ET 0 » 


l 
yw 


and since g,, is symmetric we may introduce matrix notation to write the 


above as 
: = lat 
où IUT ae Ia 
where | |” is the transpose. 
(*) Use «outer products » to show this: dat A dx? = —da?A dx, del A dat= 0. 
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Now using the theorem that the determinant of a product equals the pro- 
duct of the determinants and the fact that the determinant of the transpose 
matrix equals the determinant of the matrix we have 


dx |? 
(4.37) lidi Ever) oo 
and hence (*) 
à ss 
9 RO] CL 
(4.38) VE = rl 


From the above it follows under à co-ordinate transformation 
(4.39) V_ gdr der de? da? = V— g' dada da’? da’ . 


So that apart from sign di an da dada? da? is an invariant element of volume. 
Because of the + sign, however, it is not a true invariant but a « pseudo- 
invariant ». Another tensor density of importance in four dimensions is the 
Levi-Civita alternating symbol e,,.,, with e = 0 it two subscripts are equal 


apbyò 
and = +1 depending on whether we have an even or odd permutation 
€1934 = 61994 — — C2134 = Eos. It is also convenient to introduce the pseudotensor 
EN provided we deal with proper transformations (for which 


the Jacobian does not change sign), it transforms as a true tensor. Also we 
is i 
have E° = —(— g) Cian 


4°4. — Covariant differentiation; parallel displacement. 


Let us now consider the follow question. Is it possible to introduce deri- 
vatives of tensors in a covariant way? The reason for asking this is clear. 
Our usual laws of physics are formulated in terms of partial differential equa- 
tions and by our general principle of covariance, these equations should be 
expressible in a covariant way. If it were not possible to introduce differen- 
tiation of tensors in a covariant fashion the theory would be inconsistent. 
On the other hand we have seen from our discussion of Maxwell’s equations in 
vector form that they are covariant under arbitrary spatial transformations. 
We therefore surmise that a covariant differentiation formalism should be 
possible. To arrive at such a formalism it is convenient to consider the problem 
of differentiating vectors covariantly. 

In elementary physics we arrive at the concept of the derivative of a vec- 


() We introduce the minus sign under the radical since g is negative. 
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tor V by considering its values at a point x, V(x) and its values at a nearby 
point V(x+dx). We then displace the vector parallel to itself and take the 
difference 


V(x+Ax) 


so that we have 


(4.40) AMEN i pen LAIT 


And in special relativity we arrive at a tensor 7, — 0V;/0xi in this fashion. 
However it is clear that in general 0V,/0x% does not form a tensor for 


rl 


4.41 2 alias | 
| ) Cab Ox ea'v | bab dae ” 


OVE Cle eG Ody hae 0@'Y on! OV, | ©2x'H 
cab 0x8 ox \ Où ” 


We see that the presence of the second term destroys the transformation pro- 
perties of OV, om? Now the interesting thing about this term is that it is 
proportional to Vi, and is multiplied by a quantity having three indices 
<j. The latter is obviously not a tensor since it would vanish under linear 
transformation of the co-ordinates. 

However the appearance of this term suggests that in taking the covariant 
derivative of V, we need to add a term linear in V, which has the effect of 
cancelling this non-tensor quantity. In other words the covariant derivative 
will be linear in V, as we should expect. This additional term represents the 
change in the vector undergoing parallel displacement from the point # to 
x+Ax. We therefore conclude the change in the vector V,, DV,, taken between 
the value at the point #x+Ax and the value of the vector displaced parallel to 
itself from the point x will be of the form 


OV È 
(4.42) DI = da’ — DEV dut. 


Expressed in derivative form DV,/dx is usually designated as V,, , we have 


(4.43) VA ig TAR 
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The quantities 1%, (yet to be defined) are called the Christoffel symbols of 
the second kind. From (4.42) it follows that under parallel displacement in 
the interval dw’ a covariant vector undergoes a change 


(4.44) = + TeV da. 


In analogy with (4.41) we shall assume /%,=— 1%, although spaces in which 
this is not true have been considered in the literature. To find V*, and oh, 
we postulate that under parallel displacement the scalar product of two vec- 
tors V*U, should be unchanged, as is the case in our usage of parallel displa- 
cement in elementary physics. Hence we have (assuming the differentiation 
law of products) 


(4.45) SU. ES VOU VS ea) 
Using (4.44) this can be written 
(4.46) ($V*+ 1%, V dx )U,= 0. 


Since the vector U, is entirely arbitrary, we have 


(4.47) M Te dx’ . 
And it follows 

4.48 Be ove Zs He 
(4.48) ager Bae 3 


Let us now attempt to arrive at an expression for the J” up: We have 
; de 7 7B 
(4.49) EME RA! RE GA re 


But on the other hand if V,., is to transform as a tensor 


(4.50) Von = TONE 
from which it follows 

Chae e 
for all V? and hence, 
(4.51) Io: u =. 0 “ 


From (4.48) we therefore have multiplying by Ia) 


4.52 Tia Re NET yw 47 
( ) UST, = doh dol g J yo ABT ee) V, ; 
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and since V,,, = 0V,/0af — 1%,V,, it follows 


La A 
I Og 
x 1 va MSY œ py 
(4.53) Toe 9 def LUBY ya 


Let us now introduce the Christoffel symbols of the first kind defined by 


(4.54) 12 = ga, yp] . 
Then (4.53) becomes 


Yor 


(4.55) fo w= ae 


061, 


which may be written upon permuting the indices in the following ways, 


Jor 
Le, vB] + [y eb] = = 3) 
ox! 
à OF, 
(4.56) [y Bel + [P, vol = a 
LP, ey] + Le, By] = dj 
da” 


Now using the symmetry [f,ye]=[, ey] and subtracting the 2nd equation 
from the 3rd we have 


0G aly 
(4.57) — fy, Bel + Le, By] = ee — 


Oa” ome 


And now upon adding to the first we find 


E - 1 (09oy 008 — Yay 
(4.58) le, B= (Ge + de) 


vo 


And hence upon multiplication by g 


Lod, Ofop 9 py 
4.59 brief A eee PN 
( ) By 2 9 AaB iz dx? dre 


Finally we may derive the rules for differentiation of higher order tensors 
by treating them as products, for example, write 7°, = 4,B,; t then 
uh 


(4.60) Tg = Apa Bot Abri = APR Tg IL 
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: à De ; 
Using the rules for covariant differentiation one may recast Maxwell’s equa- 
tions in four dimensional covariant form. They become 


(4.61) Be CIRE, TE 


where F4= 4 gF", J¢é=V—gj". It is a curious result that because F* = 
—_ PF" all the above covariant derivatives may be replaced by ordinary 
derivatives. The «dual» to the F’” is defined by Por He show that 
the dual also satisfies the homogeneous Maxwell equations. Note that we 
have FP! = — 15% P* . In differentiating tensor densities the following rule 
is necessary, which can be easily derived using 9,g3= 0, g°?, = 0 and the 
definition of the determinant, è.e., g.,= 0. Also one has 1%, =39"* 0g,,| ox? — 
=d0lIn\/—g/ev’. Using this we find for a symmetric tensor 


12 vey ger 
(4.62) T= 


Je qe. 
ohi 


<a 
Ss 

DI 

8 

bo 

[SD © 


45. — The equations of motion of a particle; geodesic paths. 


The concepts of parallel displacement of a vector and covariant differen- 
tiation may be used to formulate the equations of geodesic motion in a very 
succint way. In classical mechanics we have the well-known variational prin- 
ciple 

te 


(4.63) [za =0, 


ty 


where L is the Lagrangian and is generally of the form L(q, è), where q, q are 
the generalized co-ordinates and velocities, and #, and #, are the points between 
which the variation is taken. 


In general relativity we adopt a similar variational principle which reduces 
to the above in the non-relativistic limit, 


Ps Po 
(4.64) 3 | ds = 5| V9. dd — 0. 


Pi Pi 


The mathematical interpretation of this being that we are looking for those 
paths between the space-time points P, and P, for which the four dimen- 


wo 
0 
oD 
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sional distance is an extremum and the integral stationary. Such paths are 
called geodesic paths. In the case of special relativity the I,» May be taken 
to be constants and in this case the path becomes simply an Euclidean straight 
line. However in general this is not the case. A simple physical interpre- 
tation of the variational principle is to note that since ds represents the proper 
interval of a clock, we are simply comparing the readings of clocks moved 
between P, and P,. From special relativity it is clear that if we move a 
clock C, uniformly between two space points A and B, and if we move another 
clock ©, along a winding path between A and B, and if moreover we demand 
that OC, left A and arrived at B simultaneously with C,, then it follows, C, 


will read less elapsed time than C,. Thus in special relativity the geodesic 
Pa 
path will make | ds a maximum compared with neighboring paths, although 
Pi 
more generally all we can say is that it will be an extremum. 
Let us now introduce some parameter 4 so that the paths are describable 


as #=%"(7) and we may write (4.64) as 


Poko) Path, 
/, det do 
asa 0e 
(4.65) sf] Iw ay ay 
Pi(A1) 


In this form we now have a typical variational principle with fixed end points. 
Therefore designating the integrand by L(q, 4) where ¢=dza"/dA we arrive at 
the Euler-Lagrange equations: 


d ob OL 


“a di che dee 0° 


The expressions 0L/0x, 0L/0° may be written 


OL 11 dg, da" da’ 


67) TORO 
È OL LUNA 1 he il fie 
(4.68) die = 9 L?e oY + op Yue” = n Gov - 
and hence 
d OL Dale. = dg da 
3 RES ne DURS ep log E 
(4.69) di die T2 da Ie + T ane + de 


Now let us choose J =ds/dA=1. This means we are taking as our para- 
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meter the element of the distance itself. This can be done in general except 
for a light ray along which ds=0; for this case one has to resort to a 


limiting process. Then we have with L=1, 


dx’ dg, dat da” 1 095 dx’ de 


Sr GER Pre a 
(ET Je ds? ' da ds ds 2 ove ds ds 


== (ie 


By interchanging the « dummy indices » in the second term we may write it as 


i à LU av a Lu wv 
OF ov da 2) da” Ji OJov dat da ; Cou da da 
PA) = i. aan = 
dal ds ds DOTE ds ds Gar? OS ols 


and hence finally the above can be written 


d?% dat’ da” 


TE -_ 4 eee) 
Gil) Jor as: ione ds ds 


Now multiplying by g° and using gg, = 0), g*[o, uv] = 1%, we have finally 


da x dx da” 
(4.73) 


oe = 0 
= T 1 . 
ds? SAS 


The equations are the generalization in the presence of a gravitational field 
of the special relativistic equations 


d2x* 
ds? 


The term /”,(dx“/ds)(dx”/ds) represents the effect of the gravitational field 
although in flat space the 71%, will not in general vanish for arbitrary choices 
of the co-ordinates. Because the 7%, are not the components of a tensor and 
depend only on the first derivatives of the YJ,» ib is always possible to choose 
a co-ordinate system such that locally the 1 vanish. Having done this the 
equations of motion reduce to those of special relativity. To arrive at this 
result we note that it can be shown by direct calculation that the transformation 
properties of the /" are 


uw 


0w'* da” Om _ 6 da!“ 0202 


4.75 (ae | 
( 75) 1, darf da! an" ys 71 


da? Calhoun” 
Now let us choose the transformation to be of the form 


(4.76) a = aw’? a8 tae alg!” 


‘uy 
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in the neighborhood of the point #?—0. Then upon performing the diffe- 
rentiation we have 


Al Gr a7. Mes = (22 
(4.77) Io = Lo 
and hence choosing dio = La the result follows. The system of co-ordinates 


chosen in this way is called a geodesic system or geodesic frame. In such a 
frame all the derivatives 0g, [ow vanish locally. There are forty such deriva- 
tives and similarly 40 Christoffel symbols and the two sets of quantities are 
linearly dependent. Thus we have 


Fe Fw 
0= Yuva = ia Ji; uvIav var Di due 0 
(4.78) 
C9 uo PAS | B 
ant = AY xv aie vAG up . 


Hence the vanishing of the /”s implies the vanishing of the ©g,,/0x% and con- 
versely. In such a frame the gradients of the gravitational potentials vanish 
and consequently the «intensities » or «field strengths» of the gravitational 
field. Finally, we note a more general theorem due to FERMI (1922), namely, 
we may choose co-ordinates such that they vanish along a geodesic path, %.e., 
not merely at a given point. Einsteins discussions of observations in a freely- 
falling elevator, together with many other discussions preceding Fermi’s work, 
appear to implicitly assume this theorem. It may therefore be regarded as 
a logical « check » of the theory that it implicitly « predicted » the existence 
of such a theorem. 


4°6. — Riemann curvature tensor. 


We have now seen how it is possible to develop a formalism to handle the 
problem of covariant differentiation in spaces for which the g,, are not con- 
stants, treating the time co-ordinate on the same footing as the spatial co- 
ordinates. In the case of spaces in which it is possible to introduce Cartesian 
co-ordinates the covariant derivative reduces to the ordinary derivative. How- 
ever, for some spaces it is not possible to do this: for example, the surface of 
a sphere forms a two dimensional « curved space ». It is not possible to find 
a co-ordinate system for which the element of distance takes the simple form 
do? = dw?-+ dy? and the g;; (i, j =1, 2) all become constants. For the sphere 
this is essentially intuitively obvious, however for more complicated spaces 
we clearly need some mathematical formalism for determining whether or not 
a space is curved. The physical application will be to associate certain types 
of curved spaces with the presence of permanent gravitational fields such as 
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those produced by gravitating bodies as the earth, the sun, and in fact any 
distribution of energy (by the principle of equivalence). When there are no 
such bodies present, at least locally, we may imagine that their effect on the 
curvature becomes negligible, and hence by a suitable co-ordinate transfor- 
mation the space may be made flat (*). 

We therefore proceed to look for a tensor that will characterize the cur- 
vature of space. The reason we look for a tensor is clear: if we have a tensor 
and it vanishes in one co-ordinate system it vanishes in all co-ordinate systems, 
so that our characterization of whether or not a space is flat will not depend 
on the particular system of co-ordinates chosen. Conversely if the space is 
not flat and hence the tensor is not zero, then it will not vanish in any other 
co-ordinate system we may adopt in the space. 

The above properties of a tensor are very fundamental and may be proved 
simply as follows; we have for an arbitrary tensor 7% 


n pl... 0æ'#\ ( dae Le. 
di ge (24 (22). re 


Then if 12 vanishes, clearly Ta also vanishes. On the other hand if da does 
not vanish, neither can the ee vanish. For we have also 


D À pv 
(4.80) Th = (= es Wap ao 


an!) \ axe 


and if 7" vanished so would pe: , contrary to our assumption. 

To find such a tensor it is convenient to consider what happens to a vector 
when we transport it parallel to itself around a closed circuit. We know in 
flat space when we transport a vector parallel to itself around such a circuit 
the vector is unchanged; indeed this is one of the basic assumptions of Euclidean 
geometry and consequently of elementary mechanics as well (the parallelogram 
of forces). 

However consider the surface of a sphere and consider a unit vector dæ’/do 
tangent to a geodesic on the sphere (an are of a great circle), then on parallel 
displacement, by assumption a vector V’ will maintain a constant angle with 


(*) No attempt will be made here to recount the exciting history of the develop- 
ment of curved spaces and non-Euclidean geometry. The most important contributions 
are associated with the names of SACCHERI, LEGENDRE, Gauss, LOBATCHEVSKY, BOLYAI, 
RIEMANN and BELTRAMI (who proved the consistency of Euclidean geometry implies 
the consistency of non-Euclidean geometry of the hyperbolic type). For a history of 
this development see, for example R. BONOLA: Non-Euclidean Geometry. 
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respect to dx'/do. As can be easily seen from the figure upon displacing Vi 
parallel to itself around the circuit A B C A it will, upon returning to A, no 
longer point in the original direction. 
Clearly had we chosen another circuit A 
we would in general find a different 
discrepancy. In other words, in general, 
in a curved space if we transport a vec- 
tor parallel to itself along two different 
paths from a point P to a point P’ the 
two vectors will in general have dif- 
ferent components at P’. 

We now proceed to utilize this idea 
to construct the curvature tensor. We V V 
consider the infinitesimal parallelogram Fig. 3. 
of sides 54, Ax° and a vector V# (*). 
We transport V” parallel to itself from the point (1) to the point (3) first along 

the path 123, and then along the path 143. The diffe- 


V after parallel 
displacement 


tangent 
vector 


4 3 rence in the two vectors will then be a measure of the 
curvature of the space. We therefore have upon tran- 
Axe sporting the vector V*“(1) parallel to itself to the 
| : point (2), since $V“1 — 2) ——7%(1) da*V"(1) 
5x4 
Fig. 4. (4.81) Ve(2) = Vel) — T4 (1) 5x V"(1) . 


Upon transporting it to (3), we must take into account the change undergone 
in /%,; we have 

3) Tule € a | Tu (da a ) A Tye / 
(4.82) OVH(2 > 3) = — Lex) + - in» outs V'(2)Aze, 
and substituting for V’(2) and neglecting terms of higher order than Sat Ax? 
we have 


(4.83) SV“(2 +3) = — I%V"(1) Ag? + P81) Pl) V°(1) Ant 5x — 
re 
— et (1) V"(1) da*Aa?. 


ox 


and the value of the vector at the point (3) is 


(4.84) Ves y= Vel) TO TL) + 842-8). 


(123) 


(*) Strictly speaking the « parallelogram » does not close up unless we modify 
either 5x7, or Ax? or both; however, the effects we are neglecting are of third order 
in the infinitesimals, whereas the change in V“ for which we are looking is second 
order. See, e.g., E. CARTAN: Legons sur la géométrie des espaces de Riemann (Paris, 1951). 


co 
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On the other hand if we transport the vector along the path (143) we have 


(4.85) See) = Ved) — 74) Awe V"(1) + 8048 > 4), 


ov 
where 3V"“(3 — 4) is given by 


(4.86) -SV*(3->4)=—IhV (1) de PEO Py) V%(1) Ada — 


2 pe 
_ atl”) V”(1) Sa" Ax? = 

Car 
Upon taking the difference bles V"(3) 


Lu 
(143) ( 


dae ou? 


OT _ ao pe pe à jé pr \vedae Sat 
= 3 Avi px T ov Ax XL OL. 


Note that the quantity "(3)— (3) is also the change dV“ upon taking V“ 
around the closed circuit, since we have W(3) = V"(1) +84, (3) =V"Q)+ 
aa STE hs be u ative 
+3,¥", and hence, taking the difference à J}— 0 Y= di. The negative of 
the quantity in parentheses is called the Riemann curvature tensor and is 
denoted by À, (although other notations are frequently used in the litera- 


ture). We may therefore rewrite (4.87) as 


(4.88) S VE =— Re 


(1234) a ho 


V* Aa? dat. 


I 


Upon observing that R",,,=— R",,, we may further rewrite the expression as 


{ (LASSO Ape x Ao 
(4.89) Ski = Ra AO, 
where Ao”? = Aa Ja? — Aw? da and represents the infinitesimal element of area. 
Since V*, Ao” and 3 # are tensors, it is clear that R4,, , must also be a tensor. 
The reason that à de transforms as a tensor is due to the fact that it represents 
the comparison of two vectors (the original vector and the vector transported 
around the closed path) at a point and this clearly defines another vector at 
the point. 

Another way to see that R',,, isa tensor is to directly compute V“,,,—V" 9.45 
one finds 


(4.90) PAPER RESI 
from which the tensorial character of R* ,, is clear. 

Thus the non-vanishing of the Riemann curvature tensor introduces a non- 
commutativity of covariant derivatives. The vanishing of Ris clearly a 
necessary condition that the space be flat, it may also be shown that it is a 
sufficient condition. 
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47. — Symmetry properties of the Riemann tensor and Bianchi identities. 
Another expression for the Riemann tensor is given by 
(4.91) Bring = Gin te, 


vo" 


Upon introducing the Christoffel symbols of the first kind and denoting or- 
dinary differentiation by a comma, the above expression may be written 


(492) Ru = (A, no], — (A, rl, + 9x, Jollee, uo] — [x, Ile, mel) - 


The first two terms may be rewritten, 


(4.93) [A, Lo}, at [A, uv], =a 2 (Jour Tuv,3o av. uo din) ? 
From these expressions it may be verified that R,,,, has the following sym- 
metry properties: 
[ LOC, = fr Ra uov 
(4.94) Ramo = + Riu 
| Raro gra 2e 
(4.95) Riot Ero + Ei = 0 (cyclic identities) . 


In addition to the above identities we have also some identities involving 
the first derivatives of the £,,,,, the « Bianchi identities », which state 


(4.96) PRayvost sii Po, nots =F Layevo == 0% 


These identities may be readily verified upon adopting geodesic co-ordinates 
at a given point so that the non-linear terms in (4.92) vanish and using (4.93). 

As a consequence of the identities (4.94), (4.95) the number of independent 
components N, of the Riemann tensor is considerably reduced from the n4 
(where n — 0}, the dimensionality of the space) terms one would otherwise 
have. To arrive at an expression for N, in terms of n we note that there are 
three cases to consider: 1) case where two subscripts are repeated, e.g., terms 
like R42; 2) case where one subscript is repeated, ¢.g., Ris; 3) case where 
no subscripts are repeated. In case 1), we have $n(m— 1) independent terms 
from (4.94), moreover (4.95) yields no reduction, since terms like LR, upon 
applying the cyclic identities yield Paya + RASE oo ZA Taxon 
(4.94) and hence the identity merely reproduces what we have already used 
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Reo = — Raw. (We have used latin letters above to indicate no summation 

fae CAO OOF A x E 

is intended). In case 2) we have In(n—1)(n—2) possibilities, also from 
veli 1 11168 re , id Tr. hat 

ety? Waa Rage 0, thencyelie identities do not reduce han numbe 

case 3), neglecting the cyclic identities, we have $-4n(n — 1)3(n — 2)(n — 3) 

possibilities, the extra factor } arising because of the symmetry R,,,0= Ron 

Because of the cyclic identities this number is reduced by the factor 3. Hence 


we have in total 


It is interesting to tabulate for a range of n, the values of N,, and also the 
number of components of the g,,=3#(n+1) and also of the Christoffel sym- 
bols, $n2(n+1), we have 


n tn(n41) $n? (n+1) IN 


bo 
DI 
© 
i 


3 6 13 6 
(4.98) + 10 40 20 
5 15 75 50 
6 21 126 105 
7 28 196 196 


0 0) 
w 
D 
bo 
Q0 
(0.0) 
C9 
C9 
a 


We see, for example, that the two-dimensional manifold is particularly simple 
since its curvature tensor has only one independent component, which we 
make take to be 39. 


4°8. — Contracted curvature tensor; the Ricci tensor. 


We may contract the Riemann tensor to form a tensor of rank two, the 
Ricci tensor, defined as 


(4.99) Roig 


uv Apvo © 


The tensor R,, is symmetric since using (4.94) 


À 
(4.100) By = JR = JR = GF Bova 


396 


al 


AN INTRODUCTION TO THE GENERAL THEORY OF RELATIVITY 47 
we may further contract to form the scalar curvature R 


(4.101) Ti hie yh 


Apro * 


If we now consider the case of n= 2, we find for À the following expression 
(4.102) RARA IR 
The quantity £,,13/9 is called the Gaussian curvature K of the space 


4.103) Koa : 


If we calculate K for the surface of a sphere with the line element given by 


dr? 


(4.104) Ch Bee Tes CUS SR NES r/R? 


+ r° dy?. r = Rsing. 


we find K= R-?. Similarly for the pseudo-sphere obtained by setting À ik, 
6 +—i6', we find K=— R-=. (Calculate the Gaussian curvature for a two-di- 
mensional surface associated with the spatial line-element of Chapter IT (eq. 
(2.11)) in the « plane» de — 0.) 

The geometrical significance of the Ricci tensor is obtained by calculating 
the sum of the Gaussian curvatures for the set of orthogonal planes spanned 
by geodesics through a given point in the n-dimensional Riemannian mani- 
fold (alternatively referred to as « V,»). Denoting the curvature associated 
with a given plane by K,,, one finds (*) 


(4.10) > Kup Dix Es ia ? 
B 


where Ha is a unit vector. The sum of these mean curvatures in turn defines 
the negative of the scalar curvature, using D Ai), = 9. 


x 


(*) See, for example, C. E. WEATHERBURN: An Introduction to Riemannian Geometry 
and the Tensor Calculus (Cambridge, 1938); L. P. EISENHART: Riemannian Geometry 
(Princeton, 1926); T. Levi-Civira: The Absolute Differential Calculus (London, 1927). 
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CHAPTER V. 


THE FIELD EQUATIONS AND SCHWARZSCHILD’S SOLUTION; 
EQUATIONS OF MOTION IN THE SCHWARZSCHILD FIELD 


5:1. — The Einstein tensor. 


In the preceding chapter we arrived at the Riemann curvature tensor Rae 
which in a V, has 20 independent components. One might now consider con- 
structing a theory of gravitation based directly on The However, as we 
intend to associate the presence of a permanent gravitational field with material 
sources (just as we associate an electromagnetic field with charged sources, it 
would be necessary to relate I i to a tensor of the same rank representing a 
description of matter, and we do not have a 20 component tensor providing 
such a description. Moreover we have only 10 field quantities, the g,,, and 
so we would also expect that actually we need only 10 equations. Further, 
outside a material source, we intend to assert that the space-time manifold 
is curved, and clearly if our equations were of the form 
(5.1) Ro Mia 
where M ae. described the matter-source, then we should be forced to conclude 
that outside matter, space would be flat. The geodesic paths would then be 
straight lines and we would no longer have a possible geometrical explanation 
for the phenomena associated with permanent gravitational fields—in contra- 
diction with our physical arguments based on the principle of equivalence. 
We therefore rule out any attempt to construct field equations on the basis 
Gi (HIDE 

Thus we are left with 2,,, R as the simplest possibilities. A theory based on 


Si 
ho 


logo 


where T=T (= trace of the energy-momentum tensor) is also ruled-out on 
the grounds that it does not give us enough equations to determine the ge 
We note that because of the possibility of performing four co-ordinate trans- 
formations we can in general make four of the oe constant, thus leaving us 
with six functions to determine. From (5.2) we could eliminate one, but the 
system is hopelessly undetermined. Thus a theory based on (5.2) while more 
in line with what we expect, is still not satisfactory. 
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We are therefore forced by these arguments to attempt to construct a 
theory based on R,, and T,, the symmetric energy-momentum tensor. A 
natural starting point, and indeed Einstein’ original starting point was to 


assume 


uv? 


(5.3) TELA 
However, this led to the following difficulty: We know from special relativity 
that the 7°,, satisfy certain conservation laws which in covariant notation are 
T”,,—=0; on the other hand, one may readily show R#,#0, unless R=0. 
For a while EINSTEIN thought that (5.3) was nevertheless correct and since (5.3) 
implies Ro 7, that 7 must vanish for matter just as it does for the electro- 
magnetic field. However this led to serious difficulties; moreover in analogy 
with Maxwell’s equations it would be more satisfactory if one could obtain 
TV, = 0 because of an identity in the field equations. In this way one would 
also avoid having to make specializing asumptions about the 77, 

The fact that we need a differential identity suggests abano se Bianchi 
identities. Let us therefore contract the Bianchi identities 


Leon LO cin gp ti pp 
with respect to the indices « and 6. We have 
(5.4) BR, + Rare = 0. 
Upon raising the index f, we have 


(5.5) Ri Ri, Rd 


VY 
Now contracting with respect to 6 and » this becomes 
(5.6) 2R°,5—R,,= 0 
which may be alternatively written on using dPR. s=k,, 
(5.7) (R? —36°R),,= OF 
The tensor defined by 


(5.8) GY = R¥—16"R 


has the properties of the tensor for which we were looking. Lowering the 
index u we have alternatively @,,=R,,—39,,. The tensor G,, is called the 


pv 


a 
% 4 - Supplemento al Nuovo Cimento. 
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Einstein tensor. Unfortunately the tensor G,, is not unique; we could have 
added a tensor with vanishing divergence, in particular the cosmological term 
Ag, since g,y;a = 0. The problem of whether this term does or does not 
belong in the equations has never been settled, the prevailing opinion being 
that it does not belong in the equations—in any case the « cosmological con- 
stant » À must be small, of order 10-5# cm-?, and would not lead to any meas- 
urable effects on planetary dynamics. We shall therefore disregard it in this 
chapter. 

Before proceeding to construct the field equations let us now obtain explicit 
expressions for the Ricci tensor. We find upon using the identity 7;,=0, nvV—g, 


elnv—g dI 


énv—g 
Gal Oa” ou ; 


Ou 


(5.9) = lello 


In a geodesic system of co-ordinates we can always make the 7%, vanish at 
a point so that at this point (or along a geodesic) the Ricci tensor reduces to 


elnVv—g dî 


(5.10) y PE ao 


Another useful form is to introduce « proper co-ordinates », for which by de- 
finition V—g9=1, one then has 


Gere 


Cx 


(5.11) Lin = + Diy IG, - 


It is perhaps interesting, historically, to remark that all of Hinstein’s 1915 
papers were based on using « proper co-ordinates », even through 1918 proper 
co-ordinates were used extensively. One of the simplifying aspects of proper 
co-ordinates is that many expressions are the same as those in special rela- 
tivity, for example, the Maxwell divergence equations d,(V—g F”) = V—g j" 
are the same as in special relativity; also 4/—g d#æ — dx; etc. Thus a great 
simplification is achieved. On the other hand it turns out that when it comes 
to solving the field equations, it is usually simpler to introduce co-ordinates 
that are not proper, but related to proper co-ordinates by an Euclidean map, 
i.e., in the same way that polar co-ordinates are related to rectangular co-ordi- 
nates. For some solutions even this device proves inconvenient. Some authors 
(notably Fock) prefer «harmonic co-ordinates » for which 0,(V—g g“) — 02 
Note that in contrast, proper co-ordinates satisfy 0,(V—g ò,)=0 and are 
therefore less restrictive. 

Needless to say one would be led to logical inconsistencies if one tried to 
argue for a particular choice of co-ordinate conditions to be applied indepen- 
dently of the symmetry of the problem. For example, consider the classical 


=) 
D 
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equation in flat space V-V=0, where V is some vector. In covariant nota- 
tion this can be written (0/0x,)(Vy y'4V;) — 0, where y;; is defined of course by 
do? = y,;da‘ dei. The « harmonic co-ordinate conditions » for this problem would. 
demand 0,(Vyy%)=0. However if we now suppose the problem has spherical 
symmetry, it is clearly appropriate to introduce spherical polar co-ordinates 
which do not satisfy either harmonic or proper co-ordinate conditions. Clearly 
the difficulty lies with the co-ordinate conditions rather than with the choice 
of co-ordinates. 


5'2. — Einstein’s field equations. 


Let us now attempt to summarize the preceding ideas of general relativity 
in a series of postulates which will suggest in a more formal way how we should 
go about constructing the field equations. No attempt will be made in this 
summary to state the postulates in a mathematically rigorous or concise way. 


Postulate 1. The laws of nature can be expressed in a generally covariant 
way, treating time as a co-ordinate along with the three spatial co-ordinates. 


Postulate 2. Space and time form a four-dimensional manifold for which 
there is an element of distance (in analogy with Riemannian geometry) 


ds? = g,, da" da” (i590 == 071270) 


with the interpretation given previously of ds’. In particular, we can always 
choose co-ordinates so that locally, ds? = dT?— dX*— d Y?— dZ?, that is, spe- 
cial relativity holds locally in a sufficiently small region of the space-time 
manifold just as in Riemannian geometry, with g,, positive definite, Euclidean 
geometry holds locally. 


Postulate 3. The element of distance also determines the dynamics of a 
particle (generalizing from special relativity) according to the variational prin- 


> fm as = OF 


together with ds — 0, for a light ray. Since m plays no role in this variational 
principle we have here a partial expression of the principle of equivalence. 
(Remark: It has been shown by later work that the equations of motion of 
particles follow from the field equations so that this postulate is not necessary.) 


ciple: 


Postulate 4. The principle of equivalence: Just as in an accelerated co- 
ordinate system (e.g. the rotating co-ordinate system previously considered) 
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Cr 


there are «inertial forces » arising as a consequence of the fact that the g,, 
are different from the 7, of special relativity, so too in the presence of per- 
manent gravitational fields the g,, will differ from the 7,,. 


Postulate 5. Mach’s principle: As a consequence of the relativity of mo- 
tion the inertial behavior of a body should be determined by the presence 
of other bodies in the universe. Since the dynamics of a body follows from 
Postulate 3, it follows that the metrical properties of the space-time manifold 
should be determined by the distribution of energy and momentum in the 


universe. 


Postulate 6. Relativistic correspondence principle: In the non-relativistic 
limit, Newtonian mechanics and gravitational theory hold. 


To construct the field equations using these postulates we proceed as follows: 
From Postulate 1, we know that the field equations will involve some tensors 
of various ranks and will be of the form 


LU ae [LV see 
Al 


We also know that since we are to determine the g,, from the equations (postu- 
late 4) the equations should at least involve ten quantities corresponding 
to the ten g,,. However since by a co-ordinate transformation we can always 
eliminate in general four of the g,, (more conveniently we can make go = 1, 
Joi 0), these equations should satisfy four identities so that they are under- 
determined with respect to the g,, and permit the freedom of four arbitrary 
co-ordinate transformations. Using Postulate 5 we should expect to see the 
energy-momentum tensor 7’, of matter play the role of being a source of the 
field. And finally from Postulate 6 we should expect that in the non-relati- 
vistic limit the field equations should reduce to Poisson’s equation for the 
potential 


(5.12) Vi U = dont = = axe, 


where o is the matter density. 
A system of equations having this property is the following 


(5.13) leo = — xT 


pv) 


where x =8zG/c!. We see that it satisfies four identities (from the previous 
section) 


(5.14) GY TH ee 0, 


sv 
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The fact that 7.,—0 follows from the conservation law of energy and mo- 
mentum in special relativity, 


(5.15) es Be) 


This equation is not of course generally covariant but its covariant analog is 


A = au NEMI e 
(5.16) VEUT, = a pr pat 


In the case of special relativity V—g9=1, 09,,/004=0, and this reduces to 
the above. To show that (5.13) reduces to Poisson’s equation it is convenient 
to linearize the equations setting 


(5.17) Grip = Mag hs; 


with %,, regarded as small. 
Proceeding in this way and adopting suitable co-ordinate conditions one 
can in fact show (BERGMANN, p. 182-185; MoLLER, p. 313-315) that 


For the case of a mass point o is a ò-funetion and outside this point we have 
Laplace’s equation V?h4, = 0. It follows from (5.18) that to lowest order the 
metric is given by (introducing suitable units) 


2G M 
(5.19) dei (1 — ni edt 2— dr? — r° sin? 0 dy? — r° dé? , 


as we obtained previously using the principle of equivalence. 

However, we should like to emphasize that these considerations were based 
on using a « weak-field » approximation and a suitable choice of co-ordi- 
nates. The exact structure of the field equations indicates one can obtain 7, 
directly from g” (no summation) via a first order linear differential equation 
in the co-ordinate system used below, a Laplacian operator on go showing up 
in the equation for the transverse stress. Thus some caution is necessary in 
drawing inference from the weak-field perturbation method. 


5:3. — The Schwarzschild field. 


Let us now obtain the solution to the field equations for a point mass, that 


is we look for solutions to 


(5.20) ht 39 yy À =. 
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This being the generalization of Laplace’s equation in general relativity. Note 
that since g”k,,=R the above equations also imply R=0, R,,=0. It 
should be pointed out that although R,,—0, it does not follow L° 18 = 0 
corresponding to flat space. The equations R,,—0 represent ten equations, 
whereas the equations R°,,,— 0 represent twenty equations. Only in the case 
of 2-dimensional space-time or 3-dimensional space-time would the equations 
R,,—=0 imply space-time is flat. Thus the field equations make no sense un- 
less space-time has at least 3+1 dimensions (*). 

To simplify the solution to the field equations we demand that the g,, be 
independent of time, and the line element spherically symmetrical. One can 
show that under these circumstances it can be brought into the form 


(5.21) ds? = e dt? — et dr? — r?. dO? — r? sin? 6 dq?, 


where v= y(r), A=A(r). Such a line element is described as a static line ele- 
ment with spherical symmetry. Needless to say any transformation of the 
form r= f(r*), 6* =6@ do not destroy the spherical symmetry. To solve the 
equations we now compute the Christoffel symbols corresponding to this line 
element and substitute in the field equations. It is most convenient to use the 
field equations in mixed form R’—464R=0. We have, in general, after 
computing the Christoffel symbols and substituting in the field equations (**) 


PR, 
OU 
bo 
bo 

| 

© 

DA 
ES 
= 

| 

No 
| 

eg DS 
Sa 
= 
SRE 
| 

x 

Mo 

i) 
| 

à 

| 

wow 


and we obtain the solution to the desired case upon setting == Le 
=T,=0. Subtracting the third from the first equation we have 


(5.23) exp tai (= fe = 26 
ti 1h 


and y=—/-+-a constant. The third equation can be solved directly for 4; 


() Whether the field equations and equations of motion derived from the geodesic 
postulate make sense for 0 > 4 is of course an important question, since if they did 
not, we should have a basis in general relativity for answering the frequently asked 
question, «why is d,=4? ». | 

(*) As done, for example, in Tolman’s book. 
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© 
Qt 


we have, putting v= e, v=—e-*7' and substituting 
v 1 

(5.24) DL — —— ; 
Ter 


the solution to which is v=14+ (4/r). Now since we have e” = ke — 
=k(1+(A/r)) and since we know from the principle of equivalence 
ds? =e’ dt? & (1— (2@M/rc*)) dt’, it follows we should choose k=1, A= 
=—2GM/re? and ds? becomes 


10.25) ds? = (1 == 26M e dt? a7 dr? 
(1 — 24M /rc?) 


— 7? d0? — r? sin? 0 dg? . 


DI 
ya 


The solution with I,» ®8 above (apart from a co-ordinate transformation is 
called «Schwarzschild’s exterior solution ». He also obtained a solution for 
the region inside a spherical distribution of matter, called the «interior solu- 
tion », which we shall not discuss here. 

The above discussion represents the customary approach to solving the 
field equations; it is clearly very formal and does not give much physical in- 
sight into the problem. Since these notes were first written we have been led 
to proceed in a slightly different manner (Phys. Rev. Lett., 6, 147 (1961)). 
The first thing to notice is that if we replace — e” by g”™ (i.e. g”), the equation 
for 7° is linear and first order in g!!! We note also that since the equations indi- 
cate together with the boundary conditions that goog. =—1, we may eliminate 
these two quantities in favor of the scalar potential defined by — g!!= Goo = 
=1+2U. We then find outside matter, or more generally if 79= Ti, the 
two dependent equations (*) 


V0 = — #T?, (= — xT), 


where V2? is the flat-space radial Laplacian. Thus for the only non-trivial 
solution that has ever been checked experimentally, the Schwarzschild field, one 
of the field equations of general relativity is identical with that of the Newto- 
nian potential theory V?U— 0, and the other equation an integral of it—since 
(d/dr) (r°G°) — 2rG3 = 0. We plan to discuss this curious result in greater detail 
in a subsequent publication. Needless to say, this simple linear structure is 
not obtained if we adopt, e.g. isotropic radial co-ordinates, that is, co-ordinates 


() We use the labels 1, 2, 3 to designate the co-ordinates r, 0, @p. 
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for which the spatial line element is 


do? = f(7)[d7? + 72 (a6? + sin? 9 d?)] , 


or if we use proper radial distance, 7.¢. @ = {V =gydr. 

The co-ordinate «r » that we are using is a proper transverse measure. 
It is proportional to the perimeter of a spherical surface surrounding the 
Schwarzschild singularity when measured by infinitesimal rods tangent to the 
surface along a great circle. A rod oriented in this fashion preserves its length 
unchanged relative to the system of co-ordinates, and a fortiori relative to an 
observer at infinity, in contrast with a rod oriented radially. Thus in intro- 
ducing «7» we have introduced an invariant transverse measure of radial dis- 
tance: a spherical shell is said to be closer or further from the singularity rela- 
tive to another spherical shell of perimeter 277, depending on whether r< 7) 
or r >". From our work in the previous section, it is clear r= K-*, where 
K is the Gaussian curvature of the spherical surface. Since K is the product 
of the principal curvatures (which are the same for a sphere) we have therefore 
labelled the spheres according to their radius of curvature. 


5'4. — Orbital equations in a Schwarzschild field. 
Having obtained the expression for the line element of the Schwarzschild 


solution, we are now in a position to solve the equations of motion obtained 
from the variational principle 5 | ds=0. The general equations are 


We may now compute the Christoffel symbols corresponding to the line element 
and hence obtain explicit expressions in the above. However it is possible to 
solve this problem without ever computing Christoffel symbols using the first 
integrals of the equations of motion. 

First let us assume the motion takes place in the plane 0=/2 so that 
the line element is simply (dropping the c) 


2GM 
r 


(5.27) ds? = (1 | dé? — giare — r2 dq? . 


Now in the variational principle, since ds is invariant under yY >¢y + const, { >t+ 
+ const, it follows that there will be two conserved quantities, corresponding 
to energy and angular momentum. 
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=] 


Or 


Alternatively, in the equations below, we have 2L/0t = 0L/Co = 0, 


datano 0 d OL of 


(5.28) per mx e , ° 
ds of ct ds op op 


Hence 0L/¢ci = K,, 0L/0@ = K >: Performing the differentiations we have 


DEN LEG 
r Jds 4 


(1 a dt 


dg 
PURE 76 
rv 4 Ho 


The first integral is effectively the energy conservation law, and the second, 
Kepler’s law. To see the former, expand ds then we obtain 


20M\} \ 
(5.30) ds = (1 E “i di [ È Gi Dr. (a) | ‘af i 
r >| a) 2° \at 


and substituting in (5.29) and again keeping the lowest order terms we have 


(5.31) 143 (7) +5" (2) ERP See 


The first term is nothing but the rest energy of the particle me? if we multiply 
through by m and introduce suitable units. Thus Æ,— 1 plays the role of the 
term H/m we introduced in an earlier discussion. Let us now rewrite the ex- 
pression for the line element in the following form 


a 2G M\ / dt\? a ff Ole eS dp \? 
(5.32) Ib (1 ae Ty E Joo (7) 7 (1 ; 


and simplify using (5.29). Then we have using also dr/ds = (dr/dg)(dg/ds) 


n SIAE 
(5.33) 1= Kidc0— Joo (| a pl 
which may also be written 
# 26 M 5 (ERIC | 2G M 
(5.34) (1 Tri | = LEG os (| 7 + pro 1— is . 


Now introduce u—vr-! and the above becomes 


2 
(5.35) 1—2GMu = Ki — (=) Ki, — Kw + Ky: 26 Mu. 
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Taking the derivative we obtain 

(5.36) 
The non-linear term on the right is the correction to the usual Newtonian equa- 


tions of the orbit represented by the first three terms. Thus setting K He 
G — GJe? the equation can be written as 


du GM 3GM 
ene cater are à 
à : 


(5.37) 


dg? | 
and hence in the limit ¢ — co the Newtonian equation results in accordance 
with the relativistic correspondence principle. 

Let us now look for a perturbative solution to (5.37) treating the last term 
as small. We may imagine the orbit is approximately circular and hence to 
a first approximation w= const = GM FEES Hence we look for a solution of 
the form 
ae GM 
(5.38) “= RE +. Uy , 


(2) 


and (5.37) becomes, neglecting higher order terms, 


2 d2u 6G? M2 
(5.39) de (i eK? ju AD: 


Since in the Newtonian case the solution would be 


(5.40) u = GM/K? + (constant) cos p 


gy 


corresponding to an ellipse, we see that the lowest order effect is simply to 
modify the Newtonian solution to a precessing ellipse, that is, 


È GM / 6G? M? 
5.41 = — > Sté $ Nr Re 
(5.41) “ Ke + (constant) cos | 1 OK" p. 


After g has gone through 27 it must still go through an additional amount 
for w to return to its original value at y= 0; we have, expanding the radical 


ee Be | (Ox aaa 
= eK? Se AT / == 29 , 
(5.42) i CAI 
A = 2 3 120 ra 
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TABLE II. — Theoretical values of the advance of the perihelia per century 
(e is the eccentricity of the ellipse). 


| Planet a! | ew’ 
| i | 

Mercury 437.03 | 8".847 
| Venus 8 .63 | 0 .059 
| Earth 3 .84 0 .064 
| Mars 105 0 .126 
| Jupiter 0 .06 | 0 .003 


TaBLe III. — Contributions to the motion of the perihelia of Mercury and the earth. 


Cause Motion of perihelion 
Tig (6) Mercury Earth 

Mereury 6000000 +1000000 | 6".02540".00 | —13".75+2”.3 | 
Venus 408000 + 1000 | 277.856+0.68| 345 .49+0 .8 
Earth 320390 + 300 | 90 .038-40 .08 | re 
Mars 3088000 + 3 000 2.536 +0 .00 97 .69 +0 .1 
Jupiter 1047.39 + 0.03 | 153 .584-L0 .00 | 696. 85-L0 .0 
Saturn 3499 + 4 7 .802+0 .01 18 .74-40 .0 
Uranus 22800 + 300 0 .141-+0 .00 0 57-40 .0 
Neptune 19500 + 300 0 .042 +0 .00 0 18-40 .0 
Solar oblateness — | 0 .010 +0 .02 0 .00 +-0 .0 
Moon — — | 7 .68-+0 .0 
General precession = 5025 .645+0 .50 | 5025 .65 +0 .5 

(Julian century, 

1850) 
Sum —- 5557.18 +0 .85 | 6179 .L +2 .5 
Observed motion —- 5599 .74 +0 .41 | 6183 .7 +1 .1 
Difference = 42 .56 +0 .94 4 6 ZI 
Relativity: effect — 43 .03 +0 .03 3.8 +0 .0 


(*) (m-1= reciprocal of the adopted masses, the sun’s mass taken as unity. 
Note that with improved astronomical observations and calculations (say by a factor of ten), 
the precession of Venus, Earth and Mars could also be checked with the theory. (Our notes.) 


The quantity A represents the angular distance, therefore, between two suc- 
cessive perihelions. We see that in the limit c-1 +0, À —0, in accordance 


with our relatvistic correspondence principle. 


Inserting the values of the 


various quantities above, we find for Mercury À & 43" per century. This esti- 
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mated precession is in good agreement with that found astronomically after 
the precession due to planetary perturbations has been taken into account. 
It was first calculated by LEVERRIER in the 19th century, who, on the basis 
of the discrepancy postulated the existence of another planet to account for 
the discrepancy. No such planet was found, and this constituted a funda- 
mental difficulty for Newtonian mechanics. The experimental situation is 
summarized in Tables II and III (from G. M. CLEMENCE: Rev. of Mod. 
Phys., 19 361 (1947)). 

Various attempts were made to modify the Newtonian theory, but with 
little success, since it is clearly unsatisfactory to change a theory as funda- 
mental as the Newtonian theory merely to predict one effect. With the advent 
of special relativity, it was appreciated that simply because of the increase 
of inertial mass with velocity there would be a precession. However, as we 
shall now show, using a scalar theory of gravitation, the precession is only 
È of the observed value and of the wrong sign. 

In accordance with special relativity and the hypothesis of a scalar gravi- 
tational field we postulate a variational principle of the form: 


(5.43) sfr dr +mU dr = 0, 


where dt? = di? — dw?— dy?— dz? and U is the gravitational potential U= 
=—GM/r for a «fixed source » of mass M. We have again the two conser- 
vation laws (upon introducing polar co-ordinates for dt? as above for ds?) 


(5.44) 


Substituting in the expression for dt? 


dt\2 /dr\2 dq\? 

5.45 E Peet oe ay fe 

PEA (È a ( (= , 

using (5.44), we find 

(5.46) (ae Oe Ke CEE Ko 
(2 


Now introducing «= and expanding the left-hand side we obtain 


(5.47) 1— 2GMu + G2M?u? = Ki — (FE) xt — Ku. 
dg 
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Differentiating and once again introducing ordinary units we have 


Gi 
RE 
(o ©) 


Î 


arty. o) y — EM 

dy? CUS CE 

We see the left hand side is the same as our previous eq. (5.39), except that 
the precession term is $ of the correct value and is of the wrong sign. 

This result constituted a serious difficulty, as we have indicated previously, 
for any attempts to build a theory of gravitation on a special relativistic 
basis. (*) This failure of an approach based on special relativity was to be 
expected, however, on more fundamental grounds. 


5'5. — The gravitational deflection of light. 


In Chapter III, using the principle of equivalence we calculated the deflec- 
tion of light by a gravitating body treating it as a Newtonian body of mass 
hy/c?. We now wish to perform the calculation using the correct expression 
for the line element, by directly solving the equations: 


(5.49) | das — 0, 


as = 0. 


If we examine the constants of the motion (5.29) we see that they both become 
infinite in the limit ds —0. Turning to the orbital eq. (5.37), we see that it 
only involves LC. and hence letting K,— 00, the equation becomes 


d2u 3GM 
= EE 


ni 
dg? e? 


(5.50) 


Now in the limit c-1-0, this is the equation of a straight line, similarly in 
the limit G—0. Thus the effect of the gravitational field is to destroy the 
Euclidean straight-line character of the propagation of light rays in free space. 
The null geodesics ds — 0 being those which satisfy the non-linear eq. (5.50). 

To solve this equation to desired accuracy we set as a first approximation, 
U = R- cosp, where R is the distance of closest approach to the gravitating 
body. We have therefore u=%-+4,-+..., substituting this in (5.50) we find 


du, 3G M 


(*) For a discussion of more complicated attempts see e.g., S. N. Gupta: Rev. 
Mod. Phys., 29, 334 (1957). 
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Trying a solution of the form w, — A sin? p+B we obtain A= G?M°?/c°R?, 
B= A, and « is therefore given by 


al GMs. 
(5.52) = R COS pg + ca Ri (sin? @ +1). 


Along the asymptotes we have as before 


ll GM. 
0 — R COS Pa + ER? (sin? Pi + 1) ; 
(5.53) 


Lat GM as, 95 aa 1 
Dre OO mig pe Pot 1), 


Now since gy, = (x/2) +1, where A is small, substituting we find A = 26M /c?k? 
and the total deflection is 

‘ 4G M 
or twice the total deflection found previously. It is interesting to note also: 
that the velocity of the light does not increase as it approaches the sun as for 
a Newtonian particle, but diminishes. This follows from taking the ratio of 
the two expressions in (5.29); we have 


dy a 2GM 
55 i à 
(0:00) bui r }» 


and for r= À, this becomes 


dp K 26M 
5.56 Re ? | : 
eee) DI RO : R 


But on the other hand at r=R from the line element, since dr/dt=0 we 
have 


/ 


do\? 2GM 
5.57 re (St)'_ es 
(5.57) + 1 eas 


Hence eliminating dg/dt and solving for K ,/K, and then restoring units we find 


(5.58) pa Re (i gat 
dt (1—(2@M/Re2))t re? 


Thus at perihelion the areal velocity is not Ke in terms of a clock at infinity but 


(5.59) pe IP _ Re (1 NG xy. 
dt ve? 
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Although the local velocity in terms of a clock at rest at R is of course e 1.6. 


E dp 
(1—(2GM/Re2))} di 


(5.60) Cc. 


The experimental situation on the deflection of light is summarized in the 
following table taken from R. J. TRUMPLER (op. cit., p. 108). 


TABLE IV. — Observation of light deflection. 


| Focal | Light 
Eclipse Station | length No. SEE Mesh: Dellection af p-e. | Observers 
(feet) | plates stars (005 OS ES Er 
1919 May 29 | Sobral 19 7 7 relat. | 1.98 | +.12 | Dyson 
Sobral 11 16 6— 12 |relat. | (0.86) | +.1 DAVIDSON 
Principe 18 2 5 relate | 1661 |) s2.3 EDDINGTON 
1922 Sept. 21 | Wallal 15 4 | 62+ 85|relat. | 1.72 | +.11 | CampsELL 
5 6 134-143 | relat. | 1.82 | +.15 | TRUMPLER 
Wallal 10 2 18 relat TRES | CHANT, YOUNG 
HA | 5 2 Jato relat 101,77) lato tp Gan se 
Downs “pa DODWELL 
1929 May 9 Takengon 28 4 17= 18 | absol. | (2.24) | FREUNDLICH 
relat. | 1.75 | +.13 | v. KLUBER 
| v. BRUNN 
1947 May 20 |Bocaiuva 20 1 51 relat. | 2.01 | +.18 | v. BIESBROEK 
1952 Febr. 25 | Khartoum 20 9) 9-- 11 | absol.| 1.70 | +.07 | v. BIESBROEK 


Prediction by Relativity Theory 1.75. 


These two results (precession of the perihelion of Mercury, together with 
the deflection of light) constitute the other checks of general relativity beyond 
the change of a clock’s rate in a gravitational field discussed earlier, and which 
follow immediately from our expression for ds. 

One customarily speaks of these effects as the «three checks » of general 
relativity. It should be noted that the precession of the perihelion of Mercury 
is a very severe check on the theory since it is of order Gin the calculations, the 
other effects being of order G.. It is to be hoped that the development of ex- 
perimental techniques in the coming years will make it possible to verify 
this precession effect for other planets and satellites, and also open new ways 
to measure effects of order G?. 
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5:6. — Some orbital formulae common to Newtonian mechanics and general relativity. 


Another «check » of general relativity not often discussed appears to be 
the remarkable similarity of certain expressions obtained in the theory with 
those of Newtonian mechanics. We have already commented previously on 
this in connection with the field equations. Two striking instances occur in 
the equations of motion in addition to the Keplerian areal law mentioned pre- 
viously : 

1) For a particle falling radially, the acceleration is given by 
dr GM 


ds? r2 


(5.61) . 
where of course ds measures the interval of time read by a clock falling with 
the particle. We see that if one measures time by such a freely-falling clock, 
and «distance » by the invariant transverse measure (radius of curvature of 
the spherical shells through which the particle-clock is falling), then the for- 
mula is identical with Newtonian mechanics. Moreover a Newtonian could 
not object to this interpretation of the law since these constitute alternative 
and admissible ways to make the measurement. 


2) Let a particle be undergoing circular motion, for which d?r/ds? = 0, 
then from the geodesic equations of motion we have 


(5.62) Go dt’ + Don dg” = 0 
or 

vi: Lora CA 

(5.63) (SE) = pino Toni 


in other words Kepler’s third law holds exactly. This latter result would appear 
at first glance simply a coincidence due to our choice of co-ordinates (see e.g. 
EDDINGTON, p. 89). However a more careful inspection reveals that this ex- 
pression is also identical with he which in turn is one of the diagonal com- 
ponents of the Riemann curvature tensor expressed in canonical form for the 
Schwarzschild field (see the discussion of the Petrov classification of the Rie- 
mann tensor given in F. A. E. PIRANI: Phys. Rev., 105, 1089 (1957)). The re- 
lationship therefore has fundamental significance. Similarly, upon computing 


d ey 2GM 


dr \ds2/ 3 


we find this is the same as R”,. 
It would therefore appear from the above that a geometer could show that 
in formulating Newtonian mechanics, the Newtonian was using more as- 


414 


AN INTRODUCTION TO THE GENERAL THEORY OF RELATIVITY 65 


sumptions in his space-time geometry than were necessary, and if we strike 
away some of these asumptions (e.g. the rate of a clock in motion is the same 
as a clock at rest; the co-ordinate «7» represents proper radial distance as 
well as radius of curvature), the resulting assumptions (or conclusions based on 
observations) agree with those of general relativity. In other words, there 
is a residual body of statements common to both theories, just as is the case 
for Euclidean and non-Euclidean geometry. To the extent that a particular 
result depends only on what the two theories have in common, the results will 
be the same. The above two results appear to be of this character. 

Needless to say, such a «check» of general relativity is of an aesthetic 
or logico-geometrical character rather than a direct empirical one; but in view 
of the inherent methodological difficulties of establishing any theory by empi- 
rical means alone and particularly when the predictions are so few in terms 
of what is experimentally possible (in contrast with quantum mechanics), it 
appears highly desirable to have supplementary means for criticizing and 
evaluating the many (not very successful) alternative theories to general rela- 
tivity that have been proposed, as well to sharpen our understanding of the 
general theory itself. 

However, the severest «check » of the theory will be whether or not it 
proves useful in helping us to arrive at a more unified and experimentally 
verifiable picture of microphysics and cosmology—to which we now turn. 


CHAPTER VI. 


MACH’S PRINCIPLE AND COSMOLOGY 


61. — The clock paradox in the presence of a permanent gravitational field. 


In the previous chapter we have discussed the «classical » three empirical 
checks of general relativity together with checks of a more mathematical cha- 
racter. However if these were the only kinds of checks of the theory it would 
be unsatisfactory: some checks of a cosmological character are needed. 

The reason for this is the following: the conceptual basis of relativity lies 
in the assumption that motion is an entirely relative concept, and hence it 
is only meaningful to speak of the motion of a body (*) provided there are other 


(*) The concept of a « body » is of course vague; but we usually mean something 
we can observe either directly or indirectly by physical means and which participates 
in the Laws of Nature in terms of energy balance, etc. Usually we mean something 
composed of atoms or the «elementary particles », but «fields » also may be subsumed 
in this definition. 
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bodies relative to which the motion may be referred. If we now consider, 
e.g. the famous clock paradox (*), in special relativity, we encounter a difficulty 
with respect to the relativity of motion unless there are other bodies in the 
universe besides the two clocks involved in the problem. That is to say, the 
asymmetry observed in the readings of the clocks is to be attributed to the 
fact that one of the clocks was accelerated relative to the other bodies in the 
universe. 

One might at first ask: Why relative to the other bodies in the universe, 
why not consider acceleration as absolute? The answer to the latter question 
is twofold: a) if we adopt this solution we are giving up relativity, and b) this 
solution, as we shall now show, is untenable from the standpoint of gravitation 
and the principle of equivalence. 

The proof runs as follows: 

First we remind the reader that according to the principle of equivalence 
an observer in uniform acceleration cannot decide locally whether: 


a) he is actually at rest on the surface of a gravitating body, or 


b) in «free space » accelerating relative to the distant stars and in the 
limit as the acceleration vanishes (as measured locally by an acce- 
lerometer) he cannot determine whether 


a) he is freely falling in a gravitational field, and hence accelerating re- 
lative to the distant stars, or 


b) in «free space » translating uniformly relative to the distant stars. 


We now use these ideas to construct a situation in which we get the «re- 
verse » of the usual clock paradox. 
Consider as in the diagram, a gravitating body 


Fig. 5. - Spherical body with hole 

drilled through it. Clock B falls 

through the hole to the antipodal 

point and returns to clock A at 

rest on the surface where they 
are compared. 


> 


(*) Briefly stated the paradox is the following: we have two similar clocks A and B 
at rest relative to one another so that At,=At, (i.e. a time interval of clock A=time 
interval of clock B). Let one clock now be moved or accelerated away from the other 
(in such a way that an accelerometer attached to one clock reads zero, while that 
attached to the other reads different from zero) and in such a way that it finally returns 
to the other clock. Then in general the clocks will not read the Same, and in special 
relativity, the clock whose accelerometer reads zero throughout the experiment will 
read a greater elapsed time than the other clock. 


416 


AN INTRODUCTION TO THE GENERAL THEORY OF RELATIVITY 67 


with a hole drilled through the center from one antipode to the other. And 
let there be two clocks A and B with accelerometers attached. Now let clock A 
remain at rest at the surface of the hole and let B fall through the hole to the 
other antipode and then back to A where the clocks are compared. One can 
easily show by direct calculation (*) (this will be done by us in a paper to 
appear later this year) that the clock B will read less than the clock A upon 
its return. Yet, it was clock A, whose accelerometer read different from zero 
during the motion. From the standpoint of absolute acceleration (2.6. in terms 
of the reading of an accelerometer) the result is incredible, since from this view- 
point one would argue that clock B was the clock that constituted an inertial 
frame, and clock A, the one that did not, since it appeared to be accelerated 
as determined by its accelerometer readings. One would therefore conclude 
that clock A should read less than clock B which is incorrect. 

On the other hand, if we compute the time difference between clock A and 
clock B for the situation where clock B instead of falling through the hole, 
is hurled outward from the mass, and with a sufficiently small velocity, so 
that it returns, then we find that the elapsed time read by clock A is less than 
the elapsed time indicated by B (as one would expect from a crude application 
of special relativity) (“). (In making the comparison we synchronize B with As 
as B passes A, and after B has acquired initial acceleration, so that B is in 
free fall throughout the measurement). 

By these considerations we are led to conclude that we cannot in general 
predict which of two clocks will read greater or less elapsed time after a rela- 
tive round trip voyage. The relative readings of the two clocks will not be 
determined by their local apparent acceleration but by their integrated state 
of motion relative to the other bodies in the universe acting as sources of the 
gravitational field or metric. 

In the above example, the «other bodies of the universe», were replaced 
by a large central field, presumably in the case of special relativity these other 
bodies, are the so-called « fixed-stars » whose net effect locally is to produce the 
special-relativity metric or the basic inertial frane, when local perturbations 


(*) Using Schwarzschild’s interior solution (see MoLLER, p. 330), 


--——— dr? RI a 
ds? = {A ENI ze 7/1262 di? — 1_ r/R — r2(d02 + sin? 0dg?), 


A 3V1 — r8/R? 3 =, RAS) 0305 r, = radius of sphere. 


(**) The above results may be calculated to a sufficient approximation using the 
« flat earth » line element with ds? = (1-++ 2az) d!?— go) de?— dr? — dy?, a = GM/r5. In this 
co-ordinate system, for radial motion, the radial acceleration satisfies d?2/ds? = —a, 
so that calculations are very simple. See also W. CockRan: Vistas in Astronomy, 
(Edited by A. Brrr) London, 1960, p. 78. 
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are neglected. In this case à well-defined asymmetry exists between the two 
clocks in the clock-paradox during the round trip and there is no contradiction. 
However this «explanation » makes sense only if the other bodies in the uni- 
verse do in fact give rise to the Minkowski metric (to a good approximation) 
although, as we shall see, this assumption is not free of objection. Nevertheless, 
the assumption that the basic inertial frame is determined by some over-all 
effect of mass receives support from the fact that strong local concentrations 
of matter can seriously distort the inertial frame to the extent that uniform 
motion in a straight line becomes e.g. uniform circular motion (motion in a 
Schwarzschild field), or oscillatory motion on a straight line (such as the clock 
oscillating back and forth for the interior solution). 


62. — Mach’s principle. 


The assumption that the distant stars define the basic inertial frame and 
that inertial effects therefore arise as a consequence of accelerations relative 
to the distant stars is called « Mach’s principle ». MAcH of course arrived at 
this idea, before the formulation of relativity theory, by reconsidering the 
famous argument given by NEWTON that rotation must be considered absolute. 

NEWTON pointed out that if we consider some water in a rotating pail, 


Pail in rotation. Pail “at rest. 
Water at rest. Water in rotation. 


Fig. 6. 


before the pail has transferred angular momentum to the water, the surface 
is horizontal, while after the pail has come to rest, and the angular momen- 
tum transferred to the water, the surface assumes a concave form. Now clearly 
this flatness or concavity of the water’s surface has nothing to do with the 
relative rotation of water and pail, since in both cases, they are in relative 
rotation. 

The cause therefore of the difference must be that the state of rotation 
(or rest) is with respect to some third body (or bodies). For NEWTON this third 
body was absolute space, however, MAcH argued otherwise: suppose we ima- 
gine the pail to become larger and larger, that is to Say, more massive, and 
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in fact, of the same dimensions and mass as the celestial bodies themselves, 
would it still be true that the relative rotation would play no role as to whether 
the surface of the water assumed a flat or concave shape? In other words, 
may not the distant stars in some manner play the role of the «third body »? 

In Newtonian mechanics the answer is of course, «no», since the dyna- 
mical laws are formulated with respect to an absolute frame (to within a uni- 
form translation 2.e. invariance with respect to the Galilean group), and the 
only influence of the distant stars is to change the gravitational potential 
in the neighborhood of the rotating pail. No such effects envisaged by MAcH 
are to be expected nor could be understood. 

On the other hand, when we turn to general relativity there is such a 
possibility because of the tensorial character of the field. Indeed, THIRRING 
(Phys. Zeits., 19, 33 (1918)) showed that if we have a rotating shell in 
general relativity, then the gravitational field produced inside the shell has a 
metric of the following form (using the linearized approximation to the field 
equations) 


1 DC as 5R2~?2 | a ee x 
Ty = Se a x y? 22? 
] Re? 3e | 6 no. DE 


4 GM OY 
HS Re 0 ? 
4GM x 
Joo = ASSO Pee 


930 — 0, 


together with corrections to g;; which may neglect. 

As a consequence of that this solution we see that from the terms in g, there 
will be a centrifugal force oc (¢M/c?R)w?0, 0? = a?+y?, as well as Coriolis 
forces, from the terms 91, Jo. Thus an observer 


inside the shell, at least in the plane 2=0, | 
would experience analogous effects to these expe- Prec 


rienced by an observer on a rotating dise. In < rest mass Mg 


particular if we put a small sphere of fluid at the 
center of the sphere, it would assume an ellipsoid- 
al shape, as if it were rotating relative to 
Newton’s absolute space. 
However, it is clear, this model of Thirring XE 
cannot fully explain Mach’s principle, since we Fig. 7. 
see that there are in g terms involving 2 which 
do not occur in a rotating system. Thus if we place a particle on the 2-axis 
above the origin it will experience an inward directed acceleration. The reason for 
this has sometimes been attributed to the linear approximation used, but more 
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likely it is due to the neglect of the stress system in the sphere which main- 
tains its spherical shape. Certainly however, the fact that the rotating sphere 
itself would tend to assume an elliptical shape means that we have not really 
got at the full explanation, since in the Thirring model there is no other matter 
in the universe except the sphere, %.e., the energy-momentum tensor 7°, only 
involves the sphere. Another difficulty is related to the fact that the Thirring 
model is intended for speeds v< c, whereas we have seen in Chapter II, that 
the kinematic or apparent velocities of the stars corresponds to v>e. 

If we examine the structure of the field equations given previously, it is 
not difficult to see where the fundamental trouble lies. Thus in the absence 
of matter we have 

Cl = Ik 


uv “pv 


— 39, = 0 


for which g,,=#, (of special relativity) is a perfectly valid solution. How- 
ever in this circumstance the equations of motion of a particle are obtained 
from of dx dx")? — 0 and if we transform to a rotating frame, we find the 
usual centrifugal and Coriolis forces, thus a particle can exhibit inertia even 
in the absence of distant masses and this is in clear contradiction with Mach’s 
principle. 

There is at present no accepted answer to this dilemma. We have attempted 
recently (*) a solution based on using the cosmological term Ag,, Which can 
be added to the field equations, since Ag um» = 0, however we prefer in these 
lectures to leave this point open as a matter of speculation for the reader and 
proceed to some of the other problems of cosmology. 


6°3. — Olber’s paradox and the finiteness of the universe. 


Consider now the classical questions: «Is the universe finite or infinite? » 
« Are there a finite or infinite number of stars? » A simplified approach to these 
difficult questions is that given by OLBERS (1826) (**). Let us suppose: 1) we 
have an Euclidean universe with an infinite number of stars, 2) the stars are 
distributed with a uniform density 0, 3) every star has the same lumino- 
sity, 4) the stars have been radiating and absorbing light for an infinite time 
into the past. With these assumptions we are led to the conclusion that the 
sky must be infinitely bright in contradiction with observation. 


() Nuovo Cimento, 20, 1 (1961). 

(©) For a more detailed account see BONDI, p. 19. Our presentation of this paradox 
differs somewhat from BonNpr's one. In a recent review article, O. STRUVE (Physics 
Today, 14, 54 (1961)) has called attention to an earlier statement of the paradox by 
L. DE CHESEAUX (1744). 
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To see this consider a series of spherical shells taken about a given point. 
Then the total number of stars in a given shell is 


AN = 4x or? Ar 
and the intensity of light received at the origin from the shell will be 
Al = ko Ar, 


where % is a luminosity coefficient. Hence integrating we have 


foo} 


(6.1) I= kefar= 00, 


0 


in performing this integration we have of course assumed 4), since as 7 — oo, 
because of the finite propagation-time of light, t=r/c, t oo. 

Now the fact that we obtain infinity above is of course no great mystery. 
From our assumption 4) it follows a star must have radiated an infinite amount 
of energy. If the system is to be in statistical equilibrium it must have also 
received an infinite amount of energy, which is precisely what the above for- 
mula tells us. 

Now of course there are many ways out of this difficulty. One way is to 
introduce a non-uniform density 0= 0(7) such that 


(6.2) Jew drhnite, 
while still permitting the total number of stars to be infinite (*) 
(6.3) ta fol rdrissnanite. 


For example setting 0= 00(14-(r*/R?))-!, where R is some fundamental cosmo- 
logical length. The difficulty with this model is of course the problem of con- 
servation of energy. After all how can a star continue to radiate forever with 
a constant luminosity? We can calculate life times for stars and they are ot 
the order of — 101 years (this of course was not known during Olbers time). 
Let us now assume that the stars have a finite lifetime 7 and for simplicity 
they radiate with constant luminosity during that period. Then it is clear 
that we can dispose of the infinity in (6.1), by the following assumption: 


(*) This is a simple example of the so-called hierarchical models. 
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{') The stars were all created simultaneously (with a uniform density 


x 


throughout all of space) at a time f,< 7 in the past. 


With 4) the total intensity received will be 
(6.4) I= koch. 


There are of course many difficulties with this hypothesis from the standpoint of 
our usual physical ideas, nevertheless as we shall now see all other present 
cosmologies encounter similar difficulties either at a singular point in space 
at an initial time, or at singular points throughout all space for all times. 
In any case if we make rough estimates for the above quantities k, o, and also 
o the fact that the intensity of starlight is 10-?erg/cm?-s we find 

= (105-101) years in agreement with above. As we shall see there are other 
OA of arriving at this magnitude of time. 


6°4. — Various cosmological models. 


The possibility that space is infinite and that there is an infinite number of 
stars created at a finite time ¢ in the past is of course unattractive because 
of the singling out of time over space and also the difficulties in imagining how 
such a strange event could have ever occurred. One might therefore imagine 
a model in which space is finite (for example spherical) and static such as was 
envisaged in an early model by Einstein. a 

We imagine the stars or nebulae distributed throughout space, just as dots 
on the surface of a 3-sphere. We therefore have a finite number of stars. 
However even this is not sufficient to guarantee the low brightness of the 
night sky, unless the stars have been losing energy effectively for only a finite 
period, since otherwise in such an «enclosure », the radiation content would 
have continued to increase until the background was as bright as the stars. 

The difficulty with this model is similar to that of the infinite distribution 
model (created at time £ in the past), except that one has a finite number 
created simultaneously at £ distributed throughout a finite space. This singling 
out of time as a singular point in preference to space however is unsatisfactory 
from the standpoint of relativity and one would prefer to find space itself 
also become singular when time did. Proceeding in this way we arrive at the 
so-called expanding (or explosion theory) models of the universe. In these 
models we imagine matter or energy to have been highly concentrated at some 
time in the past. Since from the standpoint of Hinstein’s field equations the 
distribution of energy to some extent determines the structure of space itself (*), 


() As we have remarked before it does not do this entirely, e.g. even if Ta 
we have many solutions possible to G,,=0; flat space, hyperbolic space, ete. 


E 
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we may therefore look upon this singular point as in some sense the beginning 
of space, time and matter. After some time interval At, the system will have 
Spatial volume AV, and in energy density o so that thereafter we can apply 
our physical laws. 

There are (at least) two objections against this model; one, physical, the 
other, philosophical. The physical objection concerns itself with the problem 
of anti-matter. We know from experiment that there are anti-protons, anti- 
neutrons, positrons, ete., and the question is, «what became of these anti- 
particles? » 

Part of the reason for the unsolved character of this difficulty is that it 
was not contemplated when these models were first formulated (1920's). In- 
deed, even in the early 1930’s, people thought that the electron was in some 
sense the anti-proton, until it was shown both experimentally and theoretically 
that this could not be the case. When the question was finally raised in con- 
nection with cosmology no one could give a satisfactory answer, beyond spe- 
culating that perhaps the antimatter was in another part of the universe (for 
example on the other side of the «sphere», if the universe is spherical). 
Although no satisfactory explanation has ever been given as to how it got 
there, 7.e., how the two kinds of matter become separated. One might specu- 
late that this has something to do with charge conjugation and parity viola- 
tions; but it is still quite an open question. Before going on to the philosph- 
ical difficulty it might be well to call attention to an early difficulty which no 
longer exists due to a change in the experimental data. In the 1930’s HUBBLE 
estimated from observing the red shift associated with the distant nebulae, 
an «age» of the universe of about 10° years. (The approximate formula is 


is de ) 

red shift — (Hubbles costant) x SAN 
dÀ r 
ese 


The reciprocal of Hubble’s constant, HT is therefore a measure of the age 
of the universe if the red shift is interpreted as a Doppler effect due to an 
expansion process. We shall discuss various relativistic models in greater detail 
anon.) 

On the other hand the age of the earth as determined from radioactive decay 
processes turned out to be of order 5-10° years, and a similar figure from me- 
teorites. This discrepancy persisted for over 20 years (1934-1955) and led to 
a great many erroneous conclusions, in particular that relativistic cosmology 
was in disagreement with experiment. However recent measurements of SAN- 
DAGE (*) and others would tend to set the value of Hubble’s age parameter 


(*) Astrophys. Journ., 127, 513 (1958). 
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at > 6-10° years and even as much as ~ 2-101 years, so that there is no longer 
a difficulty (*). 

We therefore turn to the philosophical difficulty. Various people such 
as BONDI and GoLp, and particularly F. HoyLE have argued that it is philo- 
sophically more satisfying if the universe did not start from a singularity in 
space and time, but actually is «continuously created », in accordance with 
the so-called «Perfect Cosmological Principle» According to this prin- 
ciple (or really, hypothesis), the universe presents essentially the same aspect 
throughout all time and throughout all space. That is to say, the universe 
is stationary. The importance of this stationary or steady state property, is 
that there is no unique point in time singled out as «the beginning », which 
could never be explored by means of experiment. Now, as we have seen from 
our discussion of Olbers paradox, we encounter difficulties if we have a uni- 
verse infinite in space and time with a uniform distribution of matter (which 
must be the case according to the perfect cosmological principle). The way 
out of these difficulties according to the steady-state hypothesis is to assume, 
(in agreement with observation) that the universe is expanding, or to put it 
ditferently, the distances between nebulae are increasing with time. Now be- 
cause of the Doppler shift, the light coming from the more distant nebulae 
will be shifted towards the red, and if we assume this shift is just proportional 
to distance, then this will have the result of a finite intensity of light. Indeed 
in the steady-state model the line element is taken to be 


ds? = dt? — exp [2t/T](dx? + dy? + dz?) , 


where 7 is the reciprocal of Hubble’s constant, and is a measure of the age 
of the universe (in non steady-state models). One can show setting ds=0, 
that the red shift for light of wavelength 2 coming from a nebula of distance r, 
will be 


OASI 

Ja 

Now of course if the distance between nebulae increases with time, the density 
of nebulae will diminish and this is not in agreement with the perfect cosmo- 
logical principle. They therefore are led to assume that matter is constantly 
appearing in interstellar space so as to preserve a constant density. It is for 
this reason one sometimes refers to this model as the «continuous creation » 
model. It is to be emphasized that there is no direct experimental evidence in 
favor of this hypothesis. On the other hand, there are grave difficulties with 
micro-physics. Thus HoyLE assumes that hydrogen atoms are continually ap- 


(") See however, A. SANDAGE: Ap. J., 133, 355 (1961). Thereis now a discrepancy 
with stellar age estimates. 
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pearing in interstellar spaces, which then condense to form stars and nebulae, 
the nebulae gradually separating and new hydrogen appearing. 

One of the major difficulties with this hypothesis is that it violates our 
usual microphysical conservation laws: conservation of baryons, conser- 
vation of leptons, local conservation of energy, angular momentum, etc. 

The fact that the rate of production of hydrogen according to HoYLE is 
small 

Am 30. : 
Ares nie LO= fe ome S|. 

is irrelevant, since the above conservation laws are supposed to be « strict » 
conservation laws. Indeed about the only thing one isn’t giving up in such 
a model is the conservation of charge and possibly angular momentum if one 
assumes the hydrogen is « created » in an s-state with the spins of the proton 
and electron oppositely aligned. One might try to conserve baryon and lepton 
number by assuming the simultaneous production of neutrons and anti-neutrons 
but then it would be difficult to understand why the anti-neutrons and 
neutrons do not annihilate with one another. 

Thus although the steady state model disposes of Olbers paradox and pre- 
sumably satisfies the perfect cosmological principle, it gives rise to a great 
many difficulties with respect to microscopic conservation laws. Since the 
number of things one has been able to predict and explain using these laws 
is so enormously greater than what one has been able to measure and predict 
according to the steady-state hypothesis, we cannot help but feel some hesita- 
tion towards accepting it. However within the next decade the observation 
of nebulae red shift, nebulae count, density of matter in the universe, stellar 
populations, relative abundance of the elements, etc., together with further 
developments of our knowledge of elementary particle physics should enable us 
to decide one way or the other on this bold hypothesis. 


65. — Summary of general relativistic cosmological models. 


In the preceding sections we have tried to outline some of the principal 
problems in formulating cosmological models. As we have seen they are rather 
formidable when we do not ignore the fundamental questions that must be 
asked and answered if such models are to have any profound significance. Let 
us now, however, for pedagogical purposes, ignore some of these more difficult 
questions and attempt to summarize the mathematical scheme used in general 
relativistic cosmologies. 

First, what constitutes the primary observational data? As we have re- 
marked before, a study of distant galaxies reveals a shift in the spectral lines 
A7/) which increases very nearly linearly with distance and is apparently 
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isotropic. Needless to say, when one says « proportional to distance », the 
question arises as to just how one measures distance, since we have no direct 
way of doing this. The present method consists in making the assumption 
that the absolute luminosity of an average nebula is always the same (*). There 
is a wealth of problems connected with obtaining reliable distance measures 
(e.g. for near nebulae using the period luminosity relation for cepheid variable 
stars) but from our standpoint the most important theoretical point is to 
note that the method essentially involves the assumption that the universe 
is homogeneous, otherwise we should have no way of being able to infer from 
the photographic magnitudes of the distant galaxies, their absolute magnitudes 
and hence distance. 

We therefore commence to build our general relativistic cosmology on the 
following hypotheses: 


1) Space is isotropic. 
2) Space is homogeneous. 


3) The field equations of general relativity are valid for determining 
the metric. 


However these hypotheses are obviously inadequate, since 


a) There are two forms of the field equations: 


I sia 39, E aa xT 


pv? 


Lis = 29k aa ee «I, pa 19% x 
b) We don’t know what to insert for T,,, and in the second case what 
value to choose for A. 


To proceed further therefore we make some simplifying hypotheses con- 
cerning 7°,,: namely, that we can pretend the galaxies have been smeared out 
so that they form a «dust» with an energy momentum tensor of the form 


dz’ de, 
le GIS 


Ty, = (00 + D) OP 


where p is a scalar pressure and 0, a density of energy (or mass). Sometimes. 
the energy momentum tensor for the electromagnetic field is added, although 


() Apart from the so-called Stebbins-Whitford reddening which would imply a 
dependence of luminosity on time (which incidentally is excluded in the steady-state 
theories) and indeed an additional redding as we progress backwards in time to more 


distant nebulae. Recent observations tend to make this additional shift very small, 
if it is present at all. 
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its contribution is presumed small since the mean density of the «dust » is 
of order (10-10) g/em*, whereas the comparable density of radiant energy 
is considerably smaller and therefore can be neglected in this approximate 
type of calculation. To make things even simpler one goes further and as- 
sumes that the kinetic energy of the dust is small so that all one has left is 
0, p so that terms of the form 


One chooses a simple co-ordinate system to express the line element using 
hypotheses 1) and 2) and the following assumptions: 


4) Clocks at rest with respect to the dust keep time at the same rate 
independently of position and elapsed time so that there is effectively 
an absolute time read by all clocks to within a synchronization of 
the zero point. 


5) The time (measured with such clocks) for light to propagate from a 
point A of the dust to a point B is the same as from B to A, if the 
zero point of time is taken to be the same. 


Remark: note that these two assumptions are in accordance with the homo- 
geneity and isotropy property of space extended to the time domain. Indeed 
when the astronomers assume the absolute magnitude of average nebulae to 
be the same, since the light from these nebulae is coming from the past as 
well as distant points in space, one is effectively assuming the validity of 4). 
Needless to say, one might make other assumptions such as that the rate of 
a clock in the past was slower than the same clock now and attempt to inter- 
pret the red shift this way. But we shall not consider such approaches here 
as they entail a great deal of complications. 


It follows from 4) and 5) that the line element can be written in the form (*) 
ds? = dt?— g;;dx' dx. 


When we do this we are using the so-called « co-moving » co-ordinates which 
may be visualized in the following way: Imagine for simplicity space to be 
closed and the nebulae on the surface of a 3-sphere with radius A(t), where 
R(t) is a single-valued function of time which we shall assume to be increasing. 


(*) That the line element can be always written in this form follows from general 
covariance and mathematics, but it is important to clarify the physical reasons why 
we choose to express the line element in this form and work in this co-ordinate system. 
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Approx. 
light path 
from B to A’ 


Space at 

t=constant 

great circle 
section 


Consider now two nebulae A and B at rest relative to the surface of the sphere 
with radius R(t), then this will hold for all times even though they become 
more and more separated. Thus the «spatial part » of the line element will 
be of the form 


de h(t)-dos, 


where do? does not depend on time but on the geometry of the space. Also 
by the above assumptions we have 


(6.5) ds? = di? dd? = dé? — R(t)*do?, 


so that the proper time of a clock at rest relative to the surface of radius R(t) 
is simply the co-ordinate time. We see also that terms of the form gp; dtda* 
do not appear because of the symmetry in the above picture. Also the fact that 
the proper time equals the coordinate time is clear, for suppose goo=Yoo(t, 4°). 
Now because of the rotational invariance it could only depend on the angle 
between the lines OA, OB, so goo = goo(6, t), however what would we choose 
for 0? There is no preferred nebula. Hence, we set goo = got). But this 
implies that the rate of a clock depends on time, in disagreement 
with (4), hence we integrate to obtain a time t=|V/ g,,(t') dt’, which we call 
«cosmic time ». Note that this cosmic time is in some sense a Newtonian con- 
cept since we are saying time flows uniformly for observers at rest with respect 
to the sphere of galaxies. However this idea is fundamental to special rela- 
tivity: two clocks at rest with respect to a given Lorentz frame keep time 
(uniformly) at the same rate. Finally, for simplicity in the above discussion 
we have assumed a closed spherical space, but a hyperbolic or Euclidean space 
could also have been used as well. 

Indeed it has been shown by the mathematicians (Schur’s theorem) that 
the only homogeneous isotropic spaces are those of constant curvature with 
the curvature positive (spherical or elliptic space), zero (Euclidean or « para- 
bolic »), negative (hyperbolic). 
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The spatial line element for such spaces may always be written (in isotropic 
form) 


and hence the total line element is of the form 


2 dr? sr r°(d6? + sin? 6 dq?) 
[1 + (k/4)r2}? ‘ 


ds? = dé? — Rit) 


For k=-+1, we have spherical space, k= 0, Euclidean, k —— 1, hyperbolic. 
We see therefore from the standpoint of the line element there are two quan- 
tities to determine: 


a) the curvature of the space k, 


b) the function R(t). 


We now seek to relate these quantities to the energy-momentum tensor 
which we take in accordance with our previous assumptions. Substituting 
the above values for g,, in the field equations we find: 


k+ È? 
3 CR? 


= A+ xolt), 


3 RRL Rte 4 xv 
c2R2 C= 


We therefore arrive at a set of simultaneous equations to solve. Unfortu- 
nately to determine R(t) we need to know the four quantities k, A, o(t), p(t), 
none of which are known! A rigorous solution is therefore impossible. All 
we can do is to obtain solutions for a variety of cases and try to check these 
as well as we can with observations. 

The first simplifying assumption is that the pressure term is negligible. 
This would not be true for radiation since p=t4oc? but the radiation den- 
sity is estimated to be much smaller (~10~°) than the smeared out matter 
density, so this is no difficulty at least at this stage of development. For highly 
condensed states, at earlier periods, this assumption may be invalid. The 
covariant energy-conservation law is 


av—4 n _ 1 Gag IR ENTO, 


dat 2 on” 


Since we have only one component for 7“, namely 1%, because of the above 


(eg 
a 
A 
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form for the metric, it follows that this conservation law reduces to 


D 


(Ri) e(}) = 0, 


D 
> 


or 


R(t)5 o(t) = const. 


Following standard terminology we define this constant so that 


where Æ is a measure of the total energy of the system (in suitable units), and 
is a positive quantity. 
Then the above set of equations may be rewritten as (*) 
E e A 


za 


We now consider various cases corresponding to the three values of k, dif- 
ferent values of À (relative to Æ). As can be seen on the next page there are 
a large variety of cases from which to choose, depending on one’s philosophical 
predilections and interpretation of the data. Thus for all three cases (k = +1, 
0, — 1) there are the so-called oscillating models. At some time ¢, in the past 
space collapses (R(t) —0) and the density o(t) becomes infinite. In many re- 
spects this is very difficult to visualize if k = 0, —1, since we don’t have a 
closed space. On the other hand, for a closed space X=1, such a condensa- 
tion presents no difficulty. Other cases for sufficiently large A correspond to 
monotonically expanding universes. There is no agreement on the data at 
the present although the model of Hoyle—flat, ever expanding—appears un- 
likely. 

The double line separates the oscillatory models from the continuously 
expanding ones. 

Historically, the first general relativistic model proposed (1917) was a static 
one, Einstein’s universe with R=0, X=1. It was in this model that the 
cosmological term was first introduced, with the hope that it would also serve 
to exclude solutions for T,,=0 in conformity with Mach’s principle. This 
hope was shown to be unjustified by de Sitter shortly afterwards, who obtained 
the solution, corresponding in this co-moving co-ordinate System (G. LEMAITRE, 


() The second equation may be obtained by differentiation of the above equation 
with p(t) =0. 
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x A<0 (SATO RIO <A enza 
(1) 
3 k=41 R(t) R(t) R(t) 
(2) 
a! = | E SS 
=O} {ACL) R(t) R(t) 
bi» Il — — 
k=-1| | RCE) R(t) R(t) 
t— Il t ons 


Fig. 9. — Behavior of R(t) for various model universes. 


H. P. ROBERTSON) to H=0O (ie. no matter present) and %=0, with 
R(t) = exp[t/T], T=~V3/A. The situation was greatly clarified by the wokr 
of FRIEDMANN (1922). The fact that one could obtain closed universes with- 
out the cosmological term, while maintaining a reasonable structure for the 
T,, led EINSTEIN to later conclude (1931) that the cosmological term should 
be discarded—on the basis of simplicity. The difficulty then remains that if 
one sets T,,— 0, the field equations admit the Minkowski line element as a 
valid solution, which, as indicated previously, is very puzzling from the stand 
point of Mach’s principle. 


6°6. — Calculation of red-shift from line element for model universe. 


Let us now see how we may obtain a red shift versus distance relation from 
the above line element and how it may be used to decide on the preceding 
possibilities. First of all we note that the above isotropic form may be written as 


r2 


dr È 
(6.6) ds? = dt? — R(t)? Tea + r'2(40? + sin? 0 dy?)|, 


upon setting 
Re 
(0.7) "= TH+ (Are? 


6 - Supplemento al Nuovo Cimento. 
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(0.4) 
bo 


in the preceding form. The above form is much more convenient for our pur- 
poses, since we shall consider the light rays from the distant galaxies as pro- 
pagating along the lines d0=0, dy—0. The quantity w defined as (dropping 
the prime on r for convenience) 

r 


ds 
(6.8) U =| a — kr}? 


0 


is the proper spatial distance between our galaxy, say, and a galaxy at a co- 


ordinate distance 7 away. 
Note that for kr? sufficiently small we may expand the integral to obtain 


(6.9) uz + =F = CCE 


Thus it is necessary to go to third order to obtain a difference between the 


three cases (k =1, 0, — 1). 
We now set ds=0, to find the time necessary for light, emitted from a 
galaxy at time t, in the past, to propagate to our own: arriving at tf. We have 


t ip 


. 10 2 di Xi "dr he 
(6.10) | R(t) =| (1 = kre) he 


ti 0 


Now, holding the proper distance fixed, consider the variation in the integral 
for light emitted at time f, + At, to be received at time t,+ At, (correspond- 
ing to the emission time Af, at the distant galaxy for light of a given wave 


length) we have 

di Ato INGE 
6.11 A{— == — — = 0) 
ine i RO RG) RG) 


Now since the proper time of the clocks corresponds to their co-ordinate 
time At, oc wave length of light emitted, At, oc to the wavelength received, 
hence 


(6.12) = 


Introducing the notation <= A1/1 we have 


(6.13) ce ent 
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We shall assume R(t,) may be expanded 


12 =. 
ThE 


(6.14) R(t,) Fani Ry th, 


and use this relation together with (6.10) to express Z= f(u, Ry, Ro, Ro) 
which will then relate the red shift to proper distance and R,, Ro, Ry. We 
have 

, 
me, at’ a s 
(6.15) U | Rich, or t= uk, — 3 Morti « 


t 


substituting this value for ¢ in (6.14) and the result in (6.13) we find 
(6.16) mans SEO Ru. 


It is convenient here to restore dimensions using (6.15), then we have ra ch, 
locally, hence 


(6.17) pe Ike Ta I Il (#8 vii yas) ye ET 


ee. ci Re 
neglecting terms of order (r°). The quantity R/R is called Hubble’s con- 
stant H, introduced previously. 

Experimentally one of course does not measure r directly, but as stated 
previously, galactic apparent luminosity / (energy per unit area per unit time) 
which, measured on a logarithmic scale is its apparent magnitude m. To ar- 
rive at an expression for magnitude m, it is convenient to consider the light 
emitted in the form of photons. 

Then we have for the number of photons per unit area N, emitted at 
time t, in the interval $t,, with energy he/A,, and the number per unit area N, 
received at time ¢,, in the interval dt, with energy he/A), 


he 1 
emitted = N, — — 
CIONI 
(6.18) Re 
A APERTE 


But we have 4=4;(1+2), St = dt(14+2) and clearly 


_ 4immiB(t)*eN, 
CV Ager? R(k)20? 
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hence J is given by 


l it 
°° Ager? R(t)? (1+ 2)2 02’ 


(6.19) 


where L, = 1,427? R(t)? @N,(he//,8t,) (energy/time) and is taken to be the same 
for all the galaxies (apart from the Stebbins-Whitford effect) ont he average. 
The magnitude is then expressed by 


(6.20) m = const — 2.5 log, | m = const’ — 5 log,,(1 + 2)r. 


510 0? 


Substituting for 7 one obtains a formula given e.g. by ROBERTSON (*) 


si 
(6.21) m = M,— 45.06 + 5 log 2/H + 1.086 (i a REP — 24 e+ O(e?), 


where M, is the absolute magnitude and the term — 2u has been added which 
allows for changes in magnitude, absorbtion of light in interstellar spaces. 
The quantity —R,/R,H? is the so-called deceleration parameter q. If q were 
large and positive we could say with confidence the expansion is slowing down. 
The work of SANDAGE would imply q= 2.6 +.8. However Baum (**) most 
recently found q & 5 to 1.5. On the other hand Hoyle’s model yields g = —1 
and this is ruled out both by data of BAUM and SANDAGE. 

Without a good knowledge of q it is not possible to select from the various 
models with any certainty although of course the Einstein static model is ruled 
out, and probably the steady-state model as well. 

One curious possibility, frequently considered, is that the universe is closed 
and oscillatory. Under these circumstances, eventually the nearby galaxies 
and the distant galaxies would exhibit a blue shift. Ultimately there would 
be a reuniting of the galaxies into the initial singular state. (It is to be 
emphasized that the singular state need not be a true singularity but actually 
a small space-time region where our approximations break down.) Such a uni- 
verse in a certain sense satisfies the perfect cosmological principle since, on 
the average of many oscillations, it would present the same appearance 
throughout time and, at a given time, would be homogeneous and isotro- 
pic in space. 

However a basic criticism that can be levelled against this model as well 


(*) Jubilee of Relativity Theory, in Helv. Phys. Acta, Supplementum IV (Basel, 
1950) p.125 


(°°) Astron. Journ., 62, 6 (1957). 
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as the steady-state model, and indeed all proposed models, is that they are 
in a sense too mechanistic. On the other hand, the novel phenomena asso- 
ciated with quantum mechanics and life itself, suggest that the structure of 
the universe has a richer and more subtle character than one would infer from 
these highly simplified, albeit ingenious models. Thus the future development 
of the theory must find a unified way of relating cosmology to quantum physics. 


CHAPTER VII. 


SUPPLEMENTARY REMARKS. 


For brevity we have omitted the following topics which would form the 
basis for a more advanced course: 


1) Reissner-Nordstròm solution for a charged particle; structure of the 
energy-momentum tensor of a point particle. 

Systems with axial symmetry. 

Variational principle for the field equations. 


) 
) 
) Definition of an energy-momentum complex for the gravitational field. 
) Gravitational radiation theory. 

) 


Derivation of the Newtonian and « post-Newtonian » equations of 
motion of concentrated sources from the field equations using per- 
turbation theory. 


7) Non-Riemannian geometries: spaces with torsion, Finsler space. 
8) Attempts at a unification of gravitation with electro-magnetism. 
9) Attempts at quantizing general relativity. 

10) Topological considerations. 


It is perhaps interesting to remark in connection with 6) and 9) that when 
EINSTEIN published the first of these investigations (EINSTEIN and GROMMER, 
1927), he was driven in part by a desire to see whether the field equations 
might exhibit or lead to quantum conditions. So far this hope has proved 
unjustified, but it has become increasingly clear in the past several years that 
unless one has such a deeper basis for unifying the two disciplines, one is un- 
likely to succeed. 

A basic difficulty connected with 5) is that although the weak-field gravi- 
tational wave equations have been known for nearly half a century, no such 
waves have ever been observed. Moreover a simple calculation indicates that 
their intensity ought to be fantastically small. Because of this, the subject 
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takes on a very speculative aspect, which becomes all the more so when one 
sets about attempting to quantize these unobserved waves. In addition to 
the mathematical complexity of the field equations, the situation is further 
complicated by the problems connected with 4): one does not have a true 
energy-momentum tensor for the gravitational field but a complex or « pseudo- 
tensor » which in general can be made to vanish locally or along a geodesic 
curve, or in some cases, (e.g. Schwarzschild field in quasi-rectangular proper 
co-ordinates) vanishes everywhere (SCHRODINGER, 1918). This is of course 
totally unlike electromagnetism and implies something very peculiar about 
the concept of « gravitational energy ». 

We defer a more complete discussion of these questions to another occasion. 
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